Data Engine documentation
This site combines hand-written author guides with generated API reference material.
Contents
- Getting Started
- Core Concepts
- Configuring Flows
- Authoring Flow Modules
- Flow Methods
- Database Methods
- DuckDB Helpers
- Import style
- Shared conventions
build_dimension(...)attach_dimension(...)normalize_columns(...)denormalize_columns(...)replace_rows_by_file(...)replace_rows_by_values(...)compact_database(...)read_rows_by_values(...)read_sql(...)read_table(...)replace_table(...)- Design guidance
- When to use direct DuckDB instead
- Recipes
- Recipe: Mirror every workbook
- Recipe: Filter rows and write a cleaned output
- Recipe: Capture source metadata during processing
- Recipe: Produce a stable latest snapshot
- Recipe: Read selected worksheets from a multi-sheet workbook
- Recipe: Single-file settings workflow
- Recipe: Batch read with
map(...)orstep_each(...) - Recipe: Load into DuckDB and export a summary
- Recipe: Use TOML workspace config
- Recipe: Calculate business days and keep a grouped running total
- Recipe: Offset to the next business due date
- Recipe: Write several outputs for one source
- App Runtime and Workspaces
- The two roots to keep in mind
- How the app is structured
- Authored files vs generated runtime artifacts
- Shared workspace state
- Local state vs workspace state
- Control, handoff, and control requests
- The daemon and the selected workspace
- Workspace selection
- Workspace provisioning
- VS Code provisioning
- Flow-module compilation
- Logging and run history
- The kill switch
- How this affects flow authors
- FlowContext
- API Reference
- Project Map
- Project Inventory