NSF-Funded Cyberinfrastructure

Distributed Shared
Tiered Log Store

Activity data describe things that happen rather than things that are.

ChronoLog is a distributed log storage ecosystem that uses physical time as the ordering mechanism - eliminating centralized sequencers and enabling auto-tiered storage across multiple layers. Built to capture the velocity and variety of modern activity data: from scientific instruments producing terabytes per second to AI agent audit trails.

NSF CSSI FundedC++ & Python APIsPlugin Ecosystem

MCP for AI AgentsDocker AvailableBSD-2 License

Explore the Docs GitHub

Collaborators & Partners

What Makes ChronoLog Different

Traditional distributed logs rely on centralized sequencers and consensus protocols to order events. ChronoLog takes a fundamentally different approach.

Physical Time Ordering

Uses synchronized physical clocks instead of Paxos/Raft consensus to order events. No centralized sequencer bottleneck.

Lock-Free Concurrent Writes

Multiple distributed writers append simultaneously without coordination. I/O paths are fully isolated between processes.

Auto-Tiered Storage

Data moves automatically across hot, warm, and cold storage tiers. Elastic capacity without manual data management.

Immediate Visibility

Events are visible to readers as soon as they are written. Total ordering with concurrent access at scale.

Learn more

How It Works

ChronoLog separates ingestion, ordering, persistence, and replay into five specialized services that form a tiered pipeline.

ChronoVisor

Coordination & clock synchronization

ChronoKeeper

Fast ingestion via RDMA (hot tier)

ChronoGrapher

Event assembly & story building (warm tier)

ChronoStore

Persistent archival in HDF5 (cold tier)

ChronoPlayer

Cross-tier reads & replay

Learn more

An Extensible Platform

ChronoLog is designed as a foundation that other systems build upon. Its plugin architecture supports query, streaming, storage, and AI integration workloads.

SQL Query

Familiar SQL semantics over time-ordered log data.

Pub/Sub & Streaming

Real-time event streaming via ChronoGrapher's DAG pipeline.

Key-Value Store

Time-series KV semantics with built-in ordering guarantees.

ML & Training

TensorFlow integration for time-ordered data pipelines.

AI Agent Memory

MCP server for persistent agent memory and audit trails.

Learn more

Project at a Glance

ChronoLog is a multi-year research effort backed by the National Science Foundation and developed in partnership with DOE national laboratories and universities.

$4M

NSF CSSI grant funding

Peer-reviewed publications at top-tier venues (SC, MSST, CLUSTER, IPDPS, ICS)

Years of active development since 2020

10+

Partner institutions including DOE national labs, universities, and industry

Deployed at national laboratory partner sites and university clusters

Taught in the Classroom

Used as teaching infrastructure in distributed systems and HPC graduate courses at IIT. Students learn real-world distributed log design using ChronoLog's APIs and deployment tools.

Cross-Disciplinary Research

Beyond HPC, ChronoLog has been applied in nutrition analysis (with IIT's Department of Food Science) and genomic sequencing pipelines, demonstrating its versatility across domains.

Deployed on Clusters

Active installations on the GRC research cluster and available at DOE partner sites. Used for continuous research workloads including HPC system monitoring and provenance tracking.

Learn more

Get Involved

ChronoLog is open source and welcoming collaborators. Whether you're a researcher exploring distributed log abstractions, a developer building on the plugin ecosystem, or an institution looking for scalable logging infrastructure -- we'd like to hear from you.

Documentation Community (Zulip) GitHub

Supported by a $4M National Science Foundation CSSI Grant

NSF CSSI-2104013

Distributed Shared Tiered Log Store

What Makes ChronoLog Different

Physical Time Ordering

Lock-Free Concurrent Writes

Auto-Tiered Storage

Immediate Visibility

How It Works

An Extensible Platform

SQL Query

Pub/Sub & Streaming

Key-Value Store

ML & Training

AI Agent Memory

Project at a Glance

Taught in the Classroom

Cross-Disciplinary Research

Deployed on Clusters

Get Involved

Distributed Shared
Tiered Log Store