Salesforce

MLOps Specialist

28 May 2024
Apply Now
Deadline date:
£37000 - £70000 / year

Job Description

Job Description:

Job Description

Date: May 2024

Role:  MLOps Specialist

No of Positions:1 

Description:

The area of Scientific Computing in Airbus provides engineering with state of the art high performance compute platforms & services and is a key enabler in the aircraft development of today and tomorrow. In our multifunctional team setup we are tackling the future challenges of the transformation of the scientific computing ecosystem.  We work in a multidisciplinary team setup in the context of Agile/SAFe.

We are seeking to enhance our team with a MLOPs specialist with the main focus on Artificial Intelligence and Machine Learning in order to support the sizing and setting-up of a future MLOps service for Scientific Computing.

The key challenges for this position revolve around the area of technological evolution and transformation of our platforms & services for the Scientific Computing of the Future framework and  specifically in the area of High Performance Computing of the future.

This position will bring you numerous and exciting challenges. You will benefit from working with vibrant and diverse teams of IT professionals and  engineers, developing your skills through extensive dedicated training programs, opportunities to travel and you will be empowered to make the difference! 

Qualification & Experience

  • Engineering graduate with 5-7 years of experience in engineering software applications (Design, development, infrastructure setup, support etc).

  • Strong understanding of software development, design concepts and principles.

  • Proven track record of building and maintaining applications at scale for end to end implementation.

  • Strong knowledge with regard to the set-up and operation of MLOps infrastructure and services and the necessary skills to perform the following tasks:

    • Install and test containerized solutions based on appropriate tool-sets

    • Set-up an environment for proper orchestration and scheduling of jobs / ML experiments, trainings, etc.

    • Test, provide and integrate tools relevant for AI/ML services such as Docker, Jupyter Hub, Jenkins, Elastic, Spark, Scala, Kafka, Artifactory, Grafana

    • Investigate virtualization softwares for clusters (e.g. Bright)

  • Good Linux skills, preferably on Redhat 7.x,Redhat 8.x,Redhat 9.x, Centos, and Ubuntu

  • Advanced knowledge of different AWS Services

  • Migrating complex solutions to AWS infrastructure

  • Designing cloud & on-premise solutions architecture.

  • Strong knowledge on Cloud by design, Cybersecurity is a must.

  • Deep understanding of coupled, decoupled, loosely coupled architectures.

  • Experience with Continuous Integration and Continuous Deployment processes and tools (e.g. Git, Jenkins, Ansible, etc.)

  • Experience in setting-up of hybrid on-prem/cloud solutions including cloud enabled schedulers and MLOps environments

  • Knowledge in working/integration with Large Language Processing(LLM).

  • Knowledge on GCP Gemini shall be plus.

  • Experience in REST API integration within multi-cloud and hybrid environments (AWS SDK, GCP APIs etc).

  • Experience in building solutions with AWS Sagemaker services with ML Workloads is a must.

  • Working experience on Data Platforms (like Databricks or Snowflake or Palantier) is a must.

  • Experience in Model Development, model lifecycle management and deployment is a must.

  • Experience with Scientific Compute platforms architecture (HPC, scientific computing workspaces) is desirable

  • Strong willingness to engage internal AI/ML customers and stakeholders to

    • Gather input and feedback for the configuration of the environment

    • Guide and support customers in the usage of the environment

  • Knowledge on Agile/ SAFe principle and Service Management best practices.

  • Experience in working with Version One, JIRA or other equivalent tools  for Agile projects

  • Experience in working with ServiceNow or other equivalent tools for incident, problem and change management

  • Strong willingness to engage and steer cloud service providers to enable hybrid on-prem/cloud AI/ML services

  • Strong willingness to travel for long & short time periods to Europe

Responsibilities:

  • Configuration of an AI/ML service on a dedicated computation node or as part of a bigger HPC system to enable internal customers

  • Creation of an MLOps environment to develop concepts for industrialization and maintenance of ML models

  • Development of an AI service concept for a future HPC set-up

  • Development of hybrid AI on-prem/cloud services

  • Install, configure, tune and monitor to optimum performance Linux based applications (COTS, OpenSource  & Business Owned Tools (BOTs)) in virtual and physical large scale transnational environments.

  • Engage internal customers to utilize the provided resources and to gather feedback and input to improve the MLOps environment.

  • Participate in daily scrum meetings of agile and operational teams.

  • Act as an intercultural “bridge” between Europe and India (Being at ease in interacting and exchanging with French, German or Spanish engineering teams is important.)

  • Participate in PI planning workshops each 10-12 weeks.

  • He/She shall be willing to work in European Shift Times (upto 10.30 PM IST in Summer and 11.30 PM IST in winter.) 

Other responsibilities may include:

  • Perform the software integration following Airbus internal “common installation rules” through the CDTNG layer on the codeshare filesystems 

  • Perform scaling tests of integrated tools on single cores, multiple cores, multi nodes to help engineers using the software most efficiently 

  • Resolve Level 3 incidents and perform problem analysis on IS/IT level caused by scientific computing applications

  • Create test repositories for any new tool development

  • Design, build and run automated tests 

  • Test and deploy the scientific computing software after patch deployments. 

  • Apply DevOps tools,  culture and mindset for all your activities on a daily basis

  • Interact with engineers to enlarge their request quality to required level

  • Close collaboration with business workflow designers and implementation/integration

  • Participating in escalation meetings when needed to find root causes in a large environment

  • Deliver Level 3 support to engineering end users

Success Metrics:

Success will be measured in a variety of areas, including but not limited to

  • Agile mind-set, collaborative way of working, quick reaction in case of operational issues, SLA fulfillment & service availability

  • Consistently ensure the on-time delivery and quality of the projects

  • Bring innovative cost effective solutions.

  • Achieve customer satisfaction.

  • Ability to handle a subject from demand management, to development, integration, maintenance and support.

This job requires an awareness of any potential compliance risks and a commitment to act with integrity, as the foundation for the Company’s success, reputation and sustainable growth.

Company:

Airbus India Private Limited

Employment Type:

Permanent

——-

Experience Level:

Professional

Job Family:

Digital

By submitting your CV or application you are consenting to Airbus using and storing information about you for monitoring purposes relating to your application or future employment. This information will only be used by Airbus.
Airbus is committed to achieving workforce diversity and creating an inclusive working environment. We welcome all applications irrespective of social and cultural background, age, gender, disability, sexual orientation or religious belief.

Airbus is, and always has been, committed to equal opportunities for all. As such, we will never ask for any type of monetary exchange in the frame of a recruitment process. Any impersonation of Airbus to do so should be reported to [email protected].

At Airbus, we support you to work, connect and collaborate more easily and flexibly. Wherever possible, we foster flexible working arrangements to stimulate innovative thinking.