Datasets

Browse and download crypto market data

Loading datasets...

Data Format & Access

Parquet format, Hive-partitioned, tool-compatible

Apache Parquet

Columnar storage for fast analytics

import duckdb

# Query directly from R2
con = duckdb.connect()
df = con.execute("""
  SELECT * FROM read_parquet(
    's3://quantum-edge/crypto_trades/**'
  )
  WHERE symbol = 'BTCUSDT'
    AND date >= '2025-01-01'
""").df()

Hive Partitioning

Efficient query pruning

data/crypto_trades/
  date=2025-01-01/
    hour=00/
      exchange=binance/
        instrument_type=spot/
          symbol=BTCUSDT/
            part-*.parquet

Query only the partitions you need - scan less data, get results faster

Compatible Tools

DuckDB
Polars
Pandas
Spark