Lead Analytics Engineer - Data Modeling & Quality
Skills
About the role
Arcadia is dedicated to happier, healthier days for all. We believe that there is a better healthcare world – one powered by data. Our platform transforms complex, diverse data into a unified foundation for health, helping organizations deliver better care, boost revenue, and lower costs.
We’re a team of fiercely driven individuals committed to making healthcare more sustainable—and we’re looking for passionate people to help us get there.
For more information, visit arcadia.io.
Why This Role Is Important to Arcadia
Arcadia's data platform powers population health analytics for health plans, ACOs, and provider groups across the country. As a Lead Analytics Engineer — Data Modeling & Quality, you sit at the intersection of data quality ownership and analytical data modeling. You'll own the SQL and DBT layer that transforms raw clinical and claims data into trusted, production-grade datasets, while also serving as the quality authority for the data those models produce.
This is a hybrid role — deeper SQL and DBT expertise than a traditional Data Health Professional, with a more analytical and model-focused scope than a Data Engineering role. You're less focused on pipeline infrastructure and more on the logic, shape, and trustworthiness of the data itself.
What Success Looks Like
In 3 months
Independently triage and resolve pipeline data quality issues
Author at least one new DBT model or refactor an existing one to meet current modeling standards
Design a DBT test suite for a set of models lacking coverage
Understand the end-to-end pipeline from ingress through silver and gold, and be able to trace a data quality issue to its root layer
In 6 months
Building strong working relationships with clients and cross-functional partners (Data Engineering, Customer Success)
Deeply familiar with Arcadia's full data stack — from ingress through silver, gold, and downstream consumers
Driving at least one improvement project forward, whether technical (e.g. model refactor, new DQ framework) or process-focused (e.g. promotion playbook, triage workflow)
In 12 months
Recognized as a leader within the department — peers and stakeholders seek out your expertise on data modeling and quality
Operating independently across the full scope of the role with minimal guidance
Two or more improvement projects completed and in production, with measurable impact on data quality or operational efficiency
What You'll Be Doing
DATA MODELING & DBT DEVELOPMENT
Author, review, and maintain DBT models using Spark/Hudi from ingest through bronze and silver
Help clients understand their data model, assumptions, and limitations through intentional validation
Troubleshoot and fix issues, then write DBT tests to catch issues proactively
Optimize SQL performance for slow-running jobs
Partner with Data Engineering on Hudi table design, partition strategy, and incremental patterns
DATA QUALITY OWNERSHIP
Triage and classify data quality alerts, distinguishing source-level issues from transform-layer failures
Design and maintain volume monitors and DQ monitors (null rate, distribution, future-date checks)
Author and apply clinical DQ rules (entity volume, field coverage, LOINC coverage, referential integrity) and claims validation rules across silver and gold layers
Conduct quality reviews for connector promotions — evaluating silver entity coverage, validation rule pass rates, and bronze-to-silver transformation correctness
Own the ticket queue for DQ, attribution, hierarchy, and customer-specific data quality issues, writing clear customer-facing findings
CROSS-FUNCTIONAL QUALITY COLLABORATION
Lead data quality reviews during connector installation and promotion (UAT
PRD), including claims validation playbooks and null analysis
Partner with Data Engineering on root-cause triage for errors, ingress anomalies, and silver table issues surfaced through data quality monitoring
Coordinate with the Measure Implementation Team (MIT) when data quality issues affect quality measure scores
Contribute to and enforce data modeling standards across teams
TECHNOLOGIES
Data modeling: DBT-Spark, SQL, Claude
Warehousing: Amazon Redshift, Apache Hudi, AWS Athena
Data quality: volume/DQ monitors, DBT tests
Orchestration: Argo Workflows, Airflow
Source control: Git / GitHub, PR-based review workflows
Observability: Grafana, Loki, Jira
Healthcare data: Claims (plan/professional/pharmacy), EHR (clinical entities), MPI
What You'll Bring
Education:
Bachelor's or Master's degree in Computer Science, Statistics, Business, Economics, or a related field
Experience:
Advanced SQL: window functions, complex CTEs, aggregation patterns, performance tuning on columnar databases
DBT: hands-on experience authoring models, tests, macros, and yml documentation; familiarity with incremental strategies
Healthcare data literacy: working knowledge of claims data (professional, institutional, pharmacy), clinical data (EHR entities), and common quality dimensions (member months, coverage rates, null patterns)
Data quality mindset: ability to differentiate source data issues from transform issues, design systematic validation checks, and communicate data quality findings clearly
Skills:
Clear communicator — able to translate technical findings for clients and non-technical stakeholders
Strong analytical judgment — you can look at a distribution and know when something is wrong
Ability to manage several projects simultaneously, leveraging AI tooling to stay organized and efficient
Genuine desire to learn and apply AI tools for operational efficiency
Would Love For You To Have
Experience with Spark SQL and Hudi table format
Familiarity with data quality monitoring tools
Comfortable operating in an AI-first environment using Claude to build/verify various day-to-day workflows
Exposure to population health analytics concepts: HEDIS measures, risk adjustment, value-based care metrics
Python scripting for data investigation and automation
Experience with Argo Workflows or similar orchestration platforms
Healthcare data standards: ICD-10, CPT, NDC, LOINC, NPI
What You'll Get
Work alongside a talented team on some of the most complex and rewarding challenges in healthcare data
Flexible, fully remote work environment with the resources and support to do your best work
Exposure to senior leaders
Be on the front lines of AI adoption — use cutting-edge tools to accelerate your work and shape how the team operates in an AI-first environment
Make a meaningful impact on healthcare data operations by improving the quality, reliability, and trustworthiness of data that drives patient care decisions
Be a part of a mission driven company that is transforming the healthcare industry
Become a member of the talented, energized, diverse and purpose-driven Arcadian Community
$160,000 - $185,000 a year
About Arcadia
Arcadia.io helps innovative providers and payers across the country transform healthcare to reduce cost while improving patient health. We do this by aggregating large amounts of disparate data, applying algorithms to identify opportunities to provide better patient care, and making those opportunities actionable by physicians at the point of care in near-real time. We are passionate about helping our customers drive meaningful outcomes. We are growing fast and have emerged as a market leader in the highly competitive population health management software market and have been recognized by industry analysts KLAS, IDC, Forrester, and Chilmark for our leadership. For a better sense of our brand and products, please explore our website.
Protect Yourself
If you have concerns about the authenticity of a job offer or recruitment-related communication claiming to be from Arcadia, we encourage you to verify by contacting us directly at (781) 202-3600 and select option 3. For more information, visit our website.
This position is responsible for following all Security policies and procedures in order to protect all PHI under Arcadia's custodianship as well as Arcadia Intellectual Properties. For any security-specific roles, the responsibilities would be further defined by the hiring manager.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Questions about this role
How do I apply to this Lead Analytics Engineer - Data Modeling & Quality role at Arcadia.io?
Click "Apply with AI Applyd" above. We auto-fill the application from your resume and answer screening questions in seconds. No copy and paste, no juggling tabs.
What's the typical salary for Analytics Engineer in your country?
Compensation varies by seniority, employer size, and location. When this listing publishes a salary band you'll see it in the badge row above the description.
How fast does AI Applyd auto-apply?
Most applications complete in under 90 seconds. You can track the status in your dashboard and watch the screenshot proof land the moment the application submits.
What ATS does Arcadia.io use?
AI Applyd supports Greenhouse, Lever, Ashby, Workday, iCIMS, SmartRecruiters, LinkedIn Easy Apply, and most other ATS platforms. If we can submit through the platform, we do.
Want AI Applyd to auto-apply to roles like this?
We tailor your resume per posting, fill the forms, and track replies for you.