We're looking for a Principal AI Data Engineer/Architect to lead the design of our next-generation data and AI platform. Our team ingests petabytes of data across 20+ high-volume, complex SaaS products in finance and e-commerce. We build ETLs, domain-oriented transformations, dbt models and metric definitions, and data quality measures that power customer-facing reporting for financial and tax compliance, plus traffic, growth, and product analytics.
You'll be the technical north star for scalable pipelines into Snowflake, semantic and physical data models, and the AI systems that keep them healthy while reporting to the VP, Data Engineering. You've built AI agents that can automatically maintain data quality, assist with semantic model creation, and accelerate data operations. You're equally comfortable reviewing a DBT PR, tuning a Snowflake workload, designing a domain data contract, or selecting the right LLM strategy (fine-tuning vs. retrieval-augmented) for a tax/financial use case.
#LI-Remote
What Your Responsibilities Will Be
- Lead the end-to-end data/AI architecture for petabyte-scale ingestion, transformation, modeling, and serving across 20+ SaaS products, with Snowflake as the analytical backbone.
- Shape the semantic layer and metrics platform (DBT models, tests, macros, and domain-specific metric definitions) to support customer-facing compliance reporting and internal analytics (traffic, growth, product).
- Build AI agents for data operations that detect, explain, and fix data quality issues; auto-generate/maintain DBT models and documentation; and suggest/validate domain semantic mappings.
- Design data quality at scale using a blend of rules, statistics, and ML (e.g., anomaly detection, drift, outlier scoring), with lineage and observability integrated into orchestration and CI/CD.
- Lead LLM data preparation across financial and tax domains: curate high-quality training corpora, implement secure data pipelines for fine-tuning and retrieval, and enforce governance for PII/tax data.
- Establish domain-driven standards (data contracts, ownership, SLAs/SLOs, data products) and coach teams on best practices for DBT, testing, documentation, and review.
- Optimize for performance and cost (Snowflake compute patterns, clustering/partitioning, caching/materialization strategies) to meet strict latency and concurrency needs.
- Partner with product, compliance, and engineering to translate reporting and regulatory requirements into durable, auditable data models and APIs.
- Mentor senior engineers through design reviews, pairing, and technical roadmap leadership; improve for code quality, testing, and.
- Be able to understand complex data patterns, understand how to convert signals generated from hundreds of data science & data engineering models into executive data stories. Have the to present data stories and stand up to difficult questions.
- Guide platform reliability with orchestration, incident response, lineage/impact analysis, and progressive delivery (feature flags, canaries, backfills).
What You'll Need to be Successful
- 10+ years in data engineering/architecture with hands-on Snowflake and dbt at high scale; deep SQL expertise and Python.
- Experience shipping AI/LLM systems in production, including building agents for data ops/quality and selecting the right patterns (RAG, fine-tuning, PEFT, prompt strategies).
- Track record building domain-oriented semantic layers and metric stores that serve both external customers (financial/tax compliance reporting) and internal analytics at scale.
- Mastery of data quality and observability (tests, profiling, anomaly detection, lineage, SLAs/SLOs) and integrating these into CI/CD and orchestration.
- Background in distributed data processing and streaming (e.g., Spark, Flink, Kafka/Kinesis) and modern orchestrators (Airflow, Dagster, Prefect).
- Experience with ML/MLOps for data quality and data operations (model lifecycle, evaluation, monitoring, drift, and governance).
- Practical knowledge of security, privacy, and compliance for financial/tax data (e.g., SOC 2, ISO 27001, GDPR/CCPA concepts, data masking/row access, management).
Pay Range Details
The base pay range(s) below are provided in compliance with state specific laws. Pay ranges may be different in other locations.
Colorado $199,200-$338,800 (annually)
Washington $199,200-$374,400 (annually)
California $199,200-$410,100 (annually)
NYC $220,200-$410,100 (annually)
The pay range above is the general base pay range for you in the state listed. Your actual salary/wage may be based on several factors, such as geographic location, candidate experience and qualifications, market and business considerations. This role is eligible for an annual bonus based on company performance, depending on the terms of the applicable plan and your role.
This is a remote position.
Avalara is an AI-first Company
AI is embedded in our workflows, decision-making, and products. Success here requires embracing AI as an essential capability.
You’ll bring experience using AI and AI-related technologies, ready to thrive here.
You’ll apply AI every day to business challenges - improving efficiency, contributing solutions, and driving results for your team, our company, and our customers.
You’ll grow with AI by staying curious about new trends and best practices, and by sharing what you learn so others can benefit too.
How We'll Take Care of You
Total Rewards
In addition to a great compensation package, paid time off, and paid parental leave, many Avalara employees are eligible for bonuses.
Health & Wellness
Benefits vary by location but generally include private medical, life, and disability insurance.
Inclusive culture and diversity
Avalara strongly supports diversity, equity, and inclusion, and is committed to integrating them into our business practices and our organizational culture. We also have a total of 8 employee-run resource groups, each with senior leadership and exec sponsorship.
What You Need To Know About Avalara
We’re defining the relationship between tax and tech.
We’ve already built an industry-leading cloud compliance platform, processing over 54 billion customer API calls and over 6.6 million tax returns a year. Our growth is real - we're a billion dollar business - and we’re not slowing down until we’ve achieved our mission - to be part of every transaction in the world.
We’re bright, innovative, and disruptive, like the orange we love to wear. It captures our quirky spirit and optimistic mindset. It shows off the culture we’ve designed, that empowers our people to win. We’ve been different from day one. Join us, and your career will be too.
We’re An Equal Opportunity Employer
Supporting diversity and inclusion is a cornerstone of our company — we don’t want people to fit into our culture, but to enrich it. All qualified candidates will receive consideration for employment without regard to race, color, creed, religion, age, gender, national orientation, disability, sexual orientation, US Veteran status, or any other factor protected by law. If you require any reasonable adjustments during the recruitment process, please let us know.