etl-designDeep ETL/ELT design workflow—extract patterns, transforms, loading strategies, idempotency, validation, and reconciliation. Use when designing batch data flo...
Install via ClawdBot CLI:
clawdbot install mike47512/etl-designGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Apr 24, 2026
Migrating on-premise data warehouse to cloud-based Snowflake or Redshift. Use source contract to map schemas, incremental watermark for large tables, and validation checks to ensure data integrity during cutover.
Building a batch pipeline that updates customer profiles nightly from CRM, billing, and support systems. Extract via CDC, apply deterministic transforms for dedup, and load with upsert keys for idempotency.
ETL to load claims data from multiple providers into a central analytics platform. Use full snapshots for small claims tables, versioned transforms for business rules, and quarantine bad rows for audit.
Daily batch reconciliation of inventory between ERP and warehouse management system. Use incremental extract with updated_at watermark, row count validation, and backfill for missing days.
Building a data mart that aggregates clickstream, ad spend, and conversion data. Apply deferred transforms for late-arriving facts, partition by date, and validate with checksums.
Offer a managed ETL service that automates source contracting, incremental extracts, and idempotent loads. Revenue from subscription tiers based on data volume and pipeline complexity.
Provide consulting engagements to design and harden batch pipelines for enterprises, focusing on correctness, validation, and backfill strategies. Revenue from time-and-materials or fixed-price projects.
A SaaS product that validates ETL outputs with row counts, checksums, and referential checks, alerting on breaches. Revenue from per-pipeline or per-organization licensing.
💬 Integration Tip
Pair with data-pipelines orchestration tool and implement dead-letter queues for quarantine rows to simplify troubleshooting.
Scored Apr 19, 2026
Use the @steipete/oracle CLI to bundle a prompt plus the right files and get a second-model review (API or browser) for debugging, refactors, design checks, or cross-validation.
Local search/indexing CLI (BM25 + vectors + rerank) with MCP mode.
Use when designing database schemas, writing migrations, optimizing SQL queries, fixing N+1 problems, creating indexes, setting up PostgreSQL, configuring EF Core, implementing caching, partitioning tables, or any database performance question.
Connect to Supabase for database operations, vector search, and storage. Use for storing data, running SQL queries, similarity search with pgvector, and managing tables. Triggers on requests involving databases, vector stores, embeddings, or Supabase specifically.
Use SQLite correctly with proper concurrency, pragmas, and type handling.
Write correct MySQL queries avoiding common pitfalls with character sets, indexes, and locking.