Skip to content

Overview

Probably supports the most efficient data formats for fast analysis and visualization. Connect to databases, load files, or generate synthetic data for testing.

File Formats

CSV, Parquet, Feather/Arrow, and compressed formats with optimized loading.

Database Connections

Direct connections to Snowflake with more database connectors coming soon.

Synthetic Data

AI-generated datasets for testing, prototyping, and learning analysis workflows.

No Practical Limits

Small Files
Under 100MB load instantly

Large Files
100MB - 10GB optimized loading

Massive Datasets
10GB+ with streaming processing

  1. Clean Headers: Ensure column names are descriptive and unique
  2. Consistent Formats: Use consistent date formats and number formatting
  3. Missing Values: Probably handles missing data automatically, but clean data works better
  4. File Encoding: UTF-8 encoding is recommended for international characters
  • Use Parquet: Convert large CSVs to Parquet for 10x faster loading
  • Index Your Database: Proper indexing dramatically improves query performance
  • Filter Early: Use database views to pre-filter large datasets
  • Compress Files: Use .gz or .zip compression for faster transfers

Don’t have data to test with? Try these sample datasets:

Sales Sample

Regional sales data with marketing spend, perfect for ROI analysis

Customer Sample

Customer behavior data ideal for churn analysis

File Formats

Learn about supported file formats and how to optimize file loading performance.

Database Connections

Connect to enterprise databases like Snowflake for real-time analysis.