Data Sources

Configure connections to external databases and data sources for seamless integration with your existing data infrastructure.

Database Connection Configuration

Snowflake Data Warehouse

For connecting to your Snowflake data warehouse:

Go to Settings → Data Sources
Add New Connection
Enter your database details:
- Account Identifier: Your Snowflake account name
- Username: Your Snowflake username
- Password: Your Snowflake password (securely encrypted)
- Warehouse: Default warehouse for queries
- Database: Default database to connect to
- Schema: Default schema within the database
Test Connection to verify credentials
Save for future use

PostgreSQL Configuration

For PostgreSQL database connections:

Required Information

Host: Database server address
Port: Database port (typically 5432)
Database: Database name
Username: Database username
Password: Database password

Optional Settings

SSL Mode: Connection security settings
Connection Timeout: Query timeout settings
Schema: Default schema to use

Secure Storage

All credentials are encrypted and stored locally on your machine. Nothing is sent to external servers except for the specific API calls you authorize.

Connection Examples

Snowflake Connection Example

Account: your-company.snowflakecomputing.com
Username: your_username
Password: your_secure_password
Warehouse: COMPUTE_WH
Database: ANALYTICS_DB
Schema: PUBLIC
Role: ANALYST_ROLE

PostgreSQL Connection Example

Host: analytics-db.company.com
Port: 5432
Database: analytics
Username: analyst
Password: secure_password
SSL Mode: require

Local Database Example

Host: localhost
Port: 5432
Database: local_analytics
Username: postgres
Password: local_password
SSL Mode: prefer

Security and Privacy

Data Privacy

Local Processing

All queries execute directly against your database
No data is transmitted to external servers
Results are processed entirely on your local machine
Full compliance with data privacy regulations

Credential Security

Passwords encrypted using industry-standard encryption
Credentials stored only on your local machine
Support for password-less authentication methods
Integration with enterprise identity providers

Network Security

Connection Security

SSL/TLS encryption for all database connections
Support for VPN and private network connections
Firewall-friendly connection pooling
Certificate-based authentication where supported

Access Control

Database-level permissions respected
Row-level security support
Schema and table access controls
Audit logging of database queries

Connection Management

Managing Multiple Connections

Organization Features

Connection Names: Give descriptive names to each connection
Connection Groups: Organize connections by environment or purpose
Default Connections: Set preferred connections for different data types
Connection Status: Monitor connection health and availability

Environment Management

Development: Connect to development databases for testing
Staging: Use staging environments for validation
Production: Secure connections to production data
Backup: Configure backup database connections

Connection Testing and Validation

Verification Process

Test Connection: Verify credentials and network connectivity
Schema Browsing: Preview available tables and schemas
Query Testing: Run test queries to validate permissions
Performance Testing: Check query response times

Health Monitoring

Connection Status: Real-time connection health indicators
Query Performance: Monitor query execution times
Error Tracking: Track and resolve connection issues
Usage Statistics: Monitor database usage patterns

Advanced Configuration

Performance Optimization

Query Optimization

Automatic query pushdown for better performance
Intelligent sampling for large tables
Parallel query execution when possible
Automatic caching of frequently accessed data

Resource Management

Configurable warehouse size recommendations (Snowflake)
Query cost estimation before execution
Automatic query cancellation for runaway operations
Connection pooling for multiple concurrent analyses

Integration Features

Data Source Discovery

Automatic Schema Detection: Discover tables and relationships
Metadata Import: Import column descriptions and data types
Data Lineage: Track data source dependencies
Change Detection: Monitor schema and data changes

Synchronization Options

Real-Time Updates: Live connection to changing data
Scheduled Refresh: Periodic data synchronization
Incremental Loading: Efficient updates for large datasets
Change Data Capture: Track and process data changes

File-Based Data Sources

Local File Configuration

Supported Formats

CSV: Automatic delimiter and encoding detection
Parquet: High-performance columnar format
Excel: .xlsx and .xls file support
JSON: Structured data import

File Handling Options

Watch Folders: Automatically process new files
Batch Processing: Handle multiple files simultaneously
File Validation: Verify data quality before processing
Error Handling: Robust handling of malformed files

Cloud Storage Integration

Supported Platforms

Amazon S3: Direct connection to S3 buckets
Google Cloud Storage: GCS bucket integration
Azure Blob Storage: Azure storage account connections
Dropbox/Google Drive: Personal cloud storage access

What’s Next?

Performance Optimization

Learn how to optimize performance and monitor your data source connections.

Performance Guide

Troubleshooting

Resolve common data source connection issues and configuration problems.

Troubleshooting Guide