✓ Used by 200+ Data Scientists

Production-Ready ML Datasets

Skip months of data cleaning. Our pre-validated Parquet datasets are ready for your ML pipelines, feature engineering, and model training.

Get 40% OFF This Week → Browse on HuggingFace

Perfect For Your ML Use Cases

🎯 Supervised Learning

Pre-labeled business datasets with clear target variables. Risk scores, classification labels, regression targets included.

📊 Time Series Forecasting

Daily/monthly time series data spanning 5+ years. Perfect for ARIMA, Prophet, LSTM models. No gaps.

🔗 Multi-Source Fusion

Weather × Economy, Health × Climate correlations. Unique multi-source datasets unavailable elsewhere.

🏗️ Feature Engineering

Pre-computed features reduce prep time by 80%. Rolling averages, lags, normalized scores ready to use.

Top ML-Ready Datasets

🏦 EU B2B Risk Intelligence

13,398 rows • 48 features • Perfect for credit scoring models

View →

🏠 France Real Estate (DVF)

1.17M rows • 38 features • Price prediction, market analysis

View →

🇫🇷 France Business Intelligence

105,909 rows • 59 features • Lead scoring, churn prediction

View →

🌡️ Weather × Health Correlations

15,645 rows • Cross-domain ML, epidemiology modeling

View →

📈 Trading Quant Signals

932 rows • 74 indicators • VIX regime, FX-beta features

View →

Load in 3 Lines

# Load Tiger Data in Python
import polars as pl

# Direct load from Parquet
df = pl.read_parquet("tiger-data-france_real_estate.parquet")

# 1.17M rows ready for ML
print(df.shape)  # (1168684, 38)

# Your model in minutes, not months
X = df.select(["surface", "code_postal", "type_local"])
y = df["valeur_fonciere"]

Launch Week: 40% OFF All Datasets

Use code LAUNCH40 • Ends Sunday • No hidden fees

Claim Your Discount →