SI
Stuller, Inc.
- Co-designed a metadata-driven CDC and replication platform (AWS DMS, control-table orchestration, instance autoscaling) feeding 25+ source systems into a 100+ TB Snowflake environment, cutting ~45 minutes from daily extract runs.
- Built near-real-time Oracle EBS replication on a 10-minute cadence, auto-triggering MicroStrategy cube refreshes for live manufacturing, shipping, and fulfillment reporting.
- Engineered SQL Server Change Tracking CDC with MERGE-based upserts, plus Snowflake Streams, Tasks, and dbt snapshots for SCD Type 2 history and audit trails.
- Architected Snowflake FinOps reporting used by executive leadership to forecast and govern $250K in annual warehouse spend.
- Provisioned data platform infrastructure as code — self-managed Airflow on AWS ECS via Terraform and ingestion infrastructure via AWS CDK, with Dockerized build targets for local-to-production parity.
- Re-architected a GA360 BigQuery-to-Snowflake pipeline (~70% lower egress) and migrated ingestion off Fivetran to Airflow ($1,100+/month savings).
- Established Snowflake governance and tagging (~90% compliance) and owned CI/CD and engineering standards across a 1,400+ model dbt codebase.
- Designed and built hundreds of dimensional, fact, and reporting models across sales, inventory, manufacturing/WIP, finance, purchasing, and web/marketing for 10+ teams.
- Established broad dbt test coverage (not-null, uniqueness, relationship, and custom tests) to enforce data integrity and reduce reporting errors.
- Standardized SQL quality with SQLFluff and dbt-Jinja lint rules for consistent, reviewable CI/CD workflows.
- Built a HEX + Google Analytics search-reporting app that cut report generation time by ~50%.
- Built a market basket analysis app that improved campaign targeting and reduced survivorship bias.