Warehouse: Managing Your Datasets
The Warehouse is where you manage your datasets organized in folders. Datasets are SQL-based transformations of your source data that power your dashboards and reports.
Step 1: Create a New Dataset
- Navigate to the Warehouse in the left sidebar
- Click Create Dataset
- Enter a name for your dataset
- Enter a SQL view name (must be SQL-compatible, no spaces)
- Select a folder to organize it in (or create a new folder)
- Click Create
Step 2: Write Your SQL Query
- Open your new dataset
- Use the built-in SQL editor (powered by Monaco Editor) with syntax highlighting and auto-complete
- Write your SQL to select, join, filter, and aggregate data from your source tables
- Click Preview to see results before committing
- Review the output to make sure it looks correct
Step 3: Deploy the Query
- Once your SQL is ready, click Deploy
- The platform creates a new version with a status:
- DEPLOYED: Currently active and running on schedule
- UNDEPLOYED: Saved but not active
- FAILED: Deployment encountered an error
- EDITED: Modified since last deployment
- You can roll back to any previous version at any time
Step 4: Set a Refresh Schedule
- Open the dataset settings
- Choose a trigger type:
| Trigger Type | Description |
|---|---|
| Manual | Run on demand when you need fresh data |
| Daily | Runs once every day |
| Weekly | Pick a specific day of the week |
| Monthly | Pick a specific date (1-31) |
| Cron expression | Custom timing for advanced scheduling |
- Preview the next run time in your timezone
- Save the schedule
The platform detects maintenance windows to prevent scheduling conflicts.
Step 5: Add Data Quality Tests (Optional)
- Open your dataset and go to the Tests tab
- Click Create Test
- Write a SQL assertion that should return zero rows if the data is valid
- Choose the test severity:
- BLOCKER: Prevents deployment of bad data. If the test fails, the dataset will not update.
- WARNING: Flags potential issues without blocking the refresh.
- Save the test
Tests run automatically each time the dataset refreshes. View consolidated test results and download them for review.
Step 6: Organize with Folders
- In the Warehouse, click Create Folder to add a new folder
- Drag datasets between folders to reorganize
- Use folder hierarchies to group by department, project, or data domain
Step 7: Share a Dataset (Optional)
- Open the dataset you want to share
- Configure sharing to make it available to other workspaces
- Shared workspaces can see the results but cannot view or modify the underlying SQL
Downloading Results
- Open any dataset
- Click the Download button
- Choose CSV or XLSX format
- The file is generated and a secure download link is provided