NSF-Funded Cyberinfrastructure

Distributed Shared
Tiered Log Store

Activity data describe things that happen rather than things that are.

ChronoLog is a distributed log storage ecosystem that uses physical time as the ordering mechanism - eliminating centralized sequencers and enabling auto-tiered storage across multiple layers. Built to capture the velocity and variety of modern activity data: from scientific instruments producing terabytes per second to AI agent audit trails.

NSF CSSI FundedC++ & Python APIsPlugin Ecosystem
MCP for AI AgentsDocker AvailableBSD-2 License

Collaborators & Partners

National Science FoundationArgonne National LaboratoryLawrence Livermore National LaboratorySLAC National Accelerator LaboratoryIllinois Institute of TechnologyGnosis Research CenterUniversity of ChicagoUniversity of IllinoisUniversity of WisconsinDePaul UniversityOmnibond3Red PartnersParaToolsParslInstitute for Food Safety and HealthNational Science FoundationArgonne National LaboratoryLawrence Livermore National LaboratorySLAC National Accelerator LaboratoryIllinois Institute of TechnologyGnosis Research CenterUniversity of ChicagoUniversity of IllinoisUniversity of WisconsinDePaul UniversityOmnibond3Red PartnersParaToolsParslInstitute for Food Safety and Health

What Makes ChronoLog Different

Traditional distributed logs rely on centralized sequencers and consensus protocols to order events. ChronoLog takes a fundamentally different approach.

Physical Time Ordering

Uses synchronized physical clocks instead of Paxos/Raft consensus to order events. No centralized sequencer bottleneck.

Lock-Free Concurrent Writes

Multiple distributed writers append simultaneously without coordination. I/O paths are fully isolated between processes.

Auto-Tiered Storage

Data moves automatically across hot, warm, and cold storage tiers. Elastic capacity without manual data management.

Immediate Visibility

Events are visible to readers as soon as they are written. Total ordering with concurrent access at scale.

How It Works

ChronoLog separates ingestion, ordering, persistence, and replay into five specialized services that form a tiered pipeline.

ChronoVisor

Coordination & clock synchronization

ChronoKeeper

Fast ingestion via RDMA (hot tier)

ChronoGrapher

Event assembly & story building (warm tier)

ChronoStore

Persistent archival in HDF5 (cold tier)

ChronoPlayer

Cross-tier reads & replay

Project at a Glance

ChronoLog is a multi-year research effort backed by the National Science Foundation and developed in partnership with DOE national laboratories and universities.

$4M

NSF CSSI grant funding

5

Peer-reviewed publications at top-tier venues (SC, MSST, CLUSTER, IPDPS, ICS)

4+

Years of active development since 2020

10+

Partner institutions including DOE national labs, universities, and industry

Deployed at national laboratory partner sites and university clusters

Taught in the Classroom

Used as teaching infrastructure in distributed systems and HPC graduate courses at IIT. Students learn real-world distributed log design using ChronoLog's APIs and deployment tools.

Cross-Disciplinary Research

Beyond HPC, ChronoLog has been applied in nutrition analysis (with IIT's Department of Food Science) and genomic sequencing pipelines, demonstrating its versatility across domains.

Deployed on Clusters

Active installations on the GRC research cluster and available at DOE partner sites. Used for continuous research workloads including HPC system monitoring and provenance tracking.

Get Involved

ChronoLog is open source and welcoming collaborators. Whether you're a researcher exploring distributed log abstractions, a developer building on the plugin ecosystem, or an institution looking for scalable logging infrastructure -- we'd like to hear from you.

NSF

Supported by a $4M National Science Foundation CSSI Grant

NSF CSSI-2104013

Gnosis Research Center