Skip to content

File Formats

Probably supports multiple file formats optimized for different use cases. Choose the right format for your data size and performance needs.

Best for: Small to medium datasets, data exchange, human-readable format

Most Common Format

  • Universal compatibility with all tools
  • Human-readable and easy to edit
  • Works well for datasets under 100MB
  • Fast loading and processing

CSV Features in Probably

  • Smart type inference for columns
  • Handles quoted fields and escaped characters
  • UTF-8 encoding for text data
  • Chunked upload for large files

Best for: Large datasets, analytical workloads, cloud storage

High-Performance Choice

  • 10-20x faster loading than CSV
  • Columnar storage for analytical queries
  • Built-in compression reduces file size
  • Preserves data types and metadata

Why Choose Parquet

  • Speed: Columnar format optimized for analytics
  • Compression: Typically 50-80% smaller than CSV
  • Type Safety: Preserves exact data types
  • Schema Evolution: Supports adding/removing columns
  • Cross-Platform: Works across Python, R, Spark, and more

Best for: Maximum performance, memory efficiency, cross-language compatibility

Fastest Format

  • Fastest possible loading speeds
  • Zero-copy memory mapping
  • Perfect type preservation
  • Cross-language standard (Python, R, JavaScript)

Simply drag files into Probably for instant loading:

  • Single Files: Drop individual files anywhere
  • Multiple Files: Select and drop multiple files at once
  • Folders: Drop entire folders (automatically detects compatible files)
  • Compressed Files: .zip and .gz files are automatically extracted

Use the file picker for more control:

  • Browse to specific file locations
  • Filter by file type
  • Preview file contents before loading
  • Batch selection of multiple files
FormatLoad SpeedFile SizeBest Use Case
CSVBaselineLargestSmall data, compatibility
Parquet10-20x faster50-80% smallerLarge data, analytics
Feather/Arrow25-50x faster20-40% smallerRepeated access, speed

Database Connections

Connect to enterprise databases like Snowflake for real-time analysis.

Data Preparation

Learn how to prepare and optimize your data for analysis.