Skip to main content

Running Datoria

This section covers the operational aspects of Datoria - how to run your data platform from development through testing to production deployment. We'll walk through the workflow stages that take your code and turn it into a running data pipeline.

Development Workflow Overview

Building and maintaining a data platform with Datoria follows a structured workflow:

1. Define Your Data Platform

Start by defining your tables, transformations, and dependencies in code. The Virtual Data Platform validates your definitions in real-time, catching issues before execution.

2. Test with Unit Tests

Validate your transformations with unit tests that run against the real database engine. These tests provide confidence that your code behaves as expected.

3. Run in Sandboxes

Test your transformations in isolated environments that don't affect shared resources. Sandboxes allow you to work with real data while keeping your changes contained.

4. Deploy to Dev/Stage

Once you're confident in your changes, deploy them to a shared development or staging environment for broader testing and stakeholder review.

5. Migrate to Production

Apply your changes to production schemas with Datoria's migration system, which ensures backward compatibility and prevents breaking changes.

6. Run in Production

Execute your transformations in production, leveraging Datoria's partitioning and invalidation system to process only what needs processing.

7. Compare Changes

View the differences between versions of your data platform using Datoria's diffing capabilities.

Tools for Each Stage

Datoria provides specific tools for each stage of the workflow:

StagePrimary ToolKey Capabilities
DevelopmentVS Code + Datoria LSPReal-time validation, SQL assistance
Unit TestingUnitTest frameworkTest transformations without affecting the database
Sandbox Testingdatoria sandboxRun in isolated environments
Dev/Stagedatoria migrate/run --datasetDeploy to shared environments
Migrationdatoria migrateApply schema changes safely
Productiondatoria runExecute transformations efficiently