Job Description
Work experience
o4+ years of development experience building and maintaining ETL /ELT pipelines that operate on a variety of sources, such as APIs, FTP sites, cloud-based blob stores, databases (relational and non-relational)oExperience working with operational programming tasks, such as version control, CI/CD, testing and quality assuranceoExperience with Apache data projects (Hadoop, Spark, Hive, Airflow), or cloud platform equivalents (Databricks, Azure Data Lake Services, Azure Data Factory) and in one or more of the following programming languages: Python, Scala, R, Java, Golang, Kotlin, C, or C++)oExperience with SDLC methodologies, particularly Agile and project management tools, preferably Azure DevOpsoDesign, develop, and implement a high-performance Python coding with a focus on efficiency, scalability, and performance.oExpert level on using standard ML libraries such as Numpy and PandasoAdvanced level on using ML frameworks PyTorch / TensorFlow on Azure cloud ecosystemoCollaborate closely with data scientists, machine learning engineers, and other stakeholders to understand their requirements and translate these into data-driven solutionsoTroubleshoot, debug, and resolve any issues that occur within the generative AI system development, ensuring that the models perform optimally.oDocument all processes, specifications, and training procedures, and maintain stringent version control to ensure the high quality, accuracy, and replicability of generative models.oPBI Fabric, DAX, Power Queries
Educationalqualifications
oBachelor’s in computer science engineering or equivalent orrelevant experience
oCertification in cloud technologies especially Azure, would be goodto have