Georgia Tech
Senior Data Engineer
Job Description
About the roleAs Vanna continues to grow, we’re looking for a Senior Data Engineer to build and mature the data platform capabilities that are foundational to Vanna’s success across several key domains, including population evaluation and selection, member outreach and engagement, and optimization of clinical and financial outcomes for both individual members and our populations as a whole. You’ll work closely with our engineering, product, operations, and clinical teams to develop the technology that enables our community-based teams to provide compassionate care to our members living with severe mental illness. This is a unique opportunity to join a small but growing team, where you’ll have the opportunity to develop many of our foundational data capabilities from scratch and own projects end-to-end.
You have a growth mindset and are excited about expanding your technical skills and stepping into new responsibilities as the team and company grow. What you’ll work onDesign, build, and operate cloud-based data infrastructure, optimizing for scale, performance, and cost efficiency as the size of our data growsAutomate data flows, and layer in monitoring, alerting, and data quality tests to ensure data is accurate and up-to-dateBuild interfaces that allow for exposing data and associated processing functionality directly to end users and other systemsDesign and implement production-grade ML/AI infrastructure, and partner with data scientists to take models into productionBuild frameworks and internal self-service tooling that helps the data team deliver value fasterAbout you6+ years of experience in data, ML, or back-end software engineeringAdvanced Python skillsDatabricks and Spark experience strongly preferredExperience with analytical data stores (e. g. Delta Lake, Snowflake, BigQuery) and relational databases (e.
g. Postgres, MySQL)Intermediate to advanced SQL knowledge (experience with dbt a big plus)Experience building and operating production-grade ML/AI systemsFamiliarity with CI/CDObsessive focus on end users and business impact Bonus points if you have:Familiarity with healthcare data Experience with (or a desire to learn) infrastructure as code tools like Terraform or Databricks Asset BundlesExperience working in highly-regulated environmentsExperience at an early-stage startupFamiliarity with streaming and event-driven architectures
EWJD3