DNAnexus
Principal Software Engineer (Machine Learning)
Job Description
Company Description
DNAnexus is a leading provider of secure, scalable, and intuitive biomedical data analysis software and bioinformatics applications for the life sciences and healthcare communities. The company actively manages and supports more than 80 petabytes of complex genomic, multi-omic, and clinical datasets on behalf of a growing network of collaborations with large-scale biobanks, as well as leading pharmaceutical, clinical diagnostic, academic research, and government organizations. Over 40,000 scientists across 48 countries are now using the highly collaborative, cloud-based, end-to-end platform to gain data-driven insights that can advance scientific discovery, accelerate precision medicine, and improve patient care.
Job Description
Who We’re Looking For
- You are an experienced Machine Learning Engineer who is passionate about creating and delivering cutting-edge enterprise-class AI-powered solutions.
- You will provide technical leadership to a small team being set up from scratch, focused on building industry-leading AI capabilities for our suite of products.
- You have designed and released into production, multiple ML-based solutions that solve real business problems using technologies such as deep learning, machine learning or generative AI.
- You can work autonomously without a lot of supervision, think independently and dream up practical, out-of-the-box ideas.
- This is a key position in Engineering. You and your team will provide AI thought leadership not just within Engineering, but more broadly across the company.
Responsibilities
- Design, build and productionalize robust, high-performing ML, LLM and other AI solutions per the roadmap, and integrate them into our existing product suites.
- Build the supporting services to operationalize and automate your solution including unit and integration tests, CI/CD tooling, monitoring capabilities and data pipelines.
- Collaborate closely with other Engineering teams, Product Mgmt and our CTO to shape our AI roadmap and build exciting new solutions for our customers.
- Stay current with the latest developments in AI and evangelize them within the company.
- Provide technical leadership to other engineers in your team
Qualifications
- 8+ years of software product development experience. At least 5 years of deep, intensive ML experience in a product setting, rather than in a research-oriented environment.
- A strong technical foundation with a Computer Science background, ideally with a Masters degree specializing in AI.
- You are fluent in Python and the Data Science stack
- You have experience with ML technologies and frameworks and extensive experience with at least a couple of the following: Pytorch, Tensorflow/Keras, scikit, large-scale data processing technologies (SQL, data warehouses, Spark or HDFS).
- You have Generative AI experience deploying LLMs, RAG pipelines and agents eg. GPT4, LangChain, LLama2, Mixtral or similar technologies.
- You are comfortable designing and implementing the end-to-end ML workflow to deploy your ML models to production, and monitoring them post-production. This might include writing data pipelines and other Data Engineering or ML Ops tasks.
- Are extremely customer-focused. You have built products that have moved the needle for your customers.
- Working in a distributed, remote-first environment with asynchronous communication comes naturally to you.
- Familiarity with Javascript and node.js would be an advantage but not essential.
- Experience with designing and building scalable distributed systems; familiarity with AWS (or Azure / GCP) would be an advantage
What We Care About
- Curiosity and humility. We are dedicated to learning and constantly improving. We hope you also value “unknown unknowns” and have a natural drive to figure out how everything works.
- Agility and ownership. Our teams have a high degree of freedom on how they build their systems, but they also take full ownership of what they deliver, in the long run.
- Proactivity. An internal drive to leave things in a better state than they are. Going above and beyond to help our codebase to stay healthy.
- Monitoring and automation. We strongly believe that a fully automated system is key to rapid iterations, while an extensive set of metrics highlights what needs to be done.
- Balancing research and product. We value staying up-to-date on cutting edge research, while also shipping meaningful products enabling researchers to fulfill their mission.
Additional Information
Headquartered in Mountain View, California, with over 220 team members across the United States and Europe, DNAnexus is experiencing rapid growth and market adoption. With the support of leading investors including Google Ventures and Blackstone, and trusted by hundreds of the world’s biomedical leaders, the company is at the innovative forefront with our precision health data cloud to drive scientific breakthroughs. If you are interested in joining our team, please apply today!
DNAnexus will provide the necessary support to enable the ideal candidate to work remotely within the Czech Republic.