Data Sources
Configure connections to external databases and data sources for seamless integration with your existing data infrastructure.
Database Connection Configuration
Section titled “Database Connection Configuration”Snowflake Data Warehouse
Section titled “Snowflake Data Warehouse”For connecting to your Snowflake data warehouse:
- Go to Settings → Data Sources
- Add New Connection
- Enter your database details:
- Account Identifier: Your Snowflake account name
- Username: Your Snowflake username
- Password: Your Snowflake password (securely encrypted)
- Warehouse: Default warehouse for queries
- Database: Default database to connect to
- Schema: Default schema within the database
- Test Connection to verify credentials
- Save for future use
PostgreSQL Configuration
Section titled “PostgreSQL Configuration”For PostgreSQL database connections:
Required Information
- Host: Database server address
- Port: Database port (typically 5432)
- Database: Database name
- Username: Database username
- Password: Database password
Optional Settings
- SSL Mode: Connection security settings
- Connection Timeout: Query timeout settings
- Schema: Default schema to use
Secure Storage
All credentials are encrypted and stored locally on your machine. Nothing is sent to external servers except for the specific API calls you authorize.
Connection Examples
Section titled “Connection Examples”Snowflake Connection Example
Section titled “Snowflake Connection Example”Account: your-company.snowflakecomputing.comUsername: your_usernamePassword: your_secure_passwordWarehouse: COMPUTE_WHDatabase: ANALYTICS_DBSchema: PUBLICRole: ANALYST_ROLEPostgreSQL Connection Example
Section titled “PostgreSQL Connection Example”Host: analytics-db.company.comPort: 5432Database: analyticsUsername: analystPassword: secure_passwordSSL Mode: requireLocal Database Example
Section titled “Local Database Example”Host: localhostPort: 5432Database: local_analyticsUsername: postgresPassword: local_passwordSSL Mode: preferSecurity and Privacy
Section titled “Security and Privacy”Data Privacy
Section titled “Data Privacy”Local Processing
- All queries execute directly against your database
- No data is transmitted to external servers
- Results are processed entirely on your local machine
- Full compliance with data privacy regulations
Credential Security
- Passwords encrypted using industry-standard encryption
- Credentials stored only on your local machine
- Support for password-less authentication methods
- Integration with enterprise identity providers
Network Security
Section titled “Network Security”Connection Security
- SSL/TLS encryption for all database connections
- Support for VPN and private network connections
- Firewall-friendly connection pooling
- Certificate-based authentication where supported
Access Control
- Database-level permissions respected
- Row-level security support
- Schema and table access controls
- Audit logging of database queries
Connection Management
Section titled “Connection Management”Managing Multiple Connections
Section titled “Managing Multiple Connections”Organization Features
- Connection Names: Give descriptive names to each connection
- Connection Groups: Organize connections by environment or purpose
- Default Connections: Set preferred connections for different data types
- Connection Status: Monitor connection health and availability
Environment Management
- Development: Connect to development databases for testing
- Staging: Use staging environments for validation
- Production: Secure connections to production data
- Backup: Configure backup database connections
Connection Testing and Validation
Section titled “Connection Testing and Validation”Verification Process
- Test Connection: Verify credentials and network connectivity
- Schema Browsing: Preview available tables and schemas
- Query Testing: Run test queries to validate permissions
- Performance Testing: Check query response times
Health Monitoring
- Connection Status: Real-time connection health indicators
- Query Performance: Monitor query execution times
- Error Tracking: Track and resolve connection issues
- Usage Statistics: Monitor database usage patterns
Advanced Configuration
Section titled “Advanced Configuration”Performance Optimization
Section titled “Performance Optimization”Query Optimization
- Automatic query pushdown for better performance
- Intelligent sampling for large tables
- Parallel query execution when possible
- Automatic caching of frequently accessed data
Resource Management
- Configurable warehouse size recommendations (Snowflake)
- Query cost estimation before execution
- Automatic query cancellation for runaway operations
- Connection pooling for multiple concurrent analyses
Integration Features
Section titled “Integration Features”Data Source Discovery
- Automatic Schema Detection: Discover tables and relationships
- Metadata Import: Import column descriptions and data types
- Data Lineage: Track data source dependencies
- Change Detection: Monitor schema and data changes
Synchronization Options
- Real-Time Updates: Live connection to changing data
- Scheduled Refresh: Periodic data synchronization
- Incremental Loading: Efficient updates for large datasets
- Change Data Capture: Track and process data changes
File-Based Data Sources
Section titled “File-Based Data Sources”Local File Configuration
Section titled “Local File Configuration”Supported Formats
- CSV: Automatic delimiter and encoding detection
- Parquet: High-performance columnar format
- Excel: .xlsx and .xls file support
- JSON: Structured data import
File Handling Options
- Watch Folders: Automatically process new files
- Batch Processing: Handle multiple files simultaneously
- File Validation: Verify data quality before processing
- Error Handling: Robust handling of malformed files
Cloud Storage Integration
Section titled “Cloud Storage Integration”Supported Platforms
- Amazon S3: Direct connection to S3 buckets
- Google Cloud Storage: GCS bucket integration
- Azure Blob Storage: Azure storage account connections
- Dropbox/Google Drive: Personal cloud storage access
What’s Next?
Section titled “What’s Next?”Performance Optimization
Learn how to optimize performance and monitor your data source connections.
Troubleshooting
Resolve common data source connection issues and configuration problems.