BOUSSIAS
Junior Data Scientist & Engineer role
Job Description
The data scientist and engineer role reports to the Chief Data Officer (CDO) and is part of the newly established Data Team. This is a new team that aims to organize, manage and leverage all data that BOUSSIAS collects, in order to support both strategic and tactical decision making. As a key member of our Data Team, you’ll play a crucial role in transforming our data capabilities and driving data-informed decision-making across the organization. You’ll also have the opportunity to work on diverse projects, continuously learn new technologies, and help shape the data strategy of a leading company. You’ll collaborate closely with cross- functional teams including marketing, sales, and product development to deliver data- driven insights and solutions. This role offers significant potential for growth and advancement as our data capabilities expand.
Responsibilities
· Data pipeline development and optimization: Build ETL/ELT data pipelines from multiple sources and continuously optimize them for improved performance and efficiency.
· Data governance: Contribute to establishing and maintaining data governance policies and procedures to ensure data quality, security, and compliance.
· Data quality: In particular, develop algorithms to identify data quality issues and fix them, and automate all QC processes.
· Prescriptive modeling: Build and deploy ML models to specify the next best actions (NBAs) per client and lead.
· AI pipeline development and automation: Design and implement RAG (Retrieval- Augmented Generation) frameworks to analyze internal content and online information for identifying sales opportunities. Create and maintain robust MLOps pipelines for efficient model deployment and management.
· Monitoring and alerting: Implement monitoring and alerting systems for data pipelines and models to ensure reliability and performance.
· Exploratory data analysis: Conduct exploratory data analysis to uncover actionable insights and identify opportunities for business improvement.
· BI development: Design BI dashboards that effectively communicate insights to various audiences. Build and automate report production for business users.
· Stakeholder collaboration: Work closely with business stakeholders to understand their data needs and translate them into technical requirements.
· A/B testing: Implement A/B tests to optimize business processes and decision- making.
· Documentation and knowledge sharing: Create and maintain comprehensive documentation of data processes, models, and insights for knowledge sharing across the organization.
Requirements
· 1 -3 years of professional experience in data science/engineering roles.
· BSc and MSc or other postgraduate degree in computer science, statistics, or related technical field.
· Demonstrated experience delivering end-to-end data science/engineering projects.
Data Skills:
· Strong proficiency in Python for data analysis and machine learning
· Experience with data analysis libraries like pandas, NumPy, and scikit-learn
· Expertise in building and deploying machine learning and AI models
· Knowledge of deep learning frameworks such as TensorFlow or PyTorch
· Experience with big data technologies such as Spark
· Proficiency in SQL and working with relational databases
· Experience building ETL pipelines on cloud platforms (GCP and/or AWS)
· Familiarity with cloud data warehouses like BigQuery or Redshift
· Experience with data lakes (e.g. S3)
· Familiarity with data visualization and BI platforms such as PowerBI, Looker, or Tableau
· Experience with version control using Git
Cloud skills:
· Hands-on experience with GCP and/or AWS services for data and ML workflows
· Familiarity with serverless computing (e.g. AWS Lambda, Google Cloud Functions)
Soft skills:
· Strong analytical and problem-solving abilities
· Excellent communication skills to present findings to technical and non-technical audiences
· Ability to work independently and as part of a team
· Solution oriented
· Attention to detail
· Growth mindset and eagerness to learn new technologies
Additional desired :
· Experience designing and implementing ETL processes to sync data between Salesforce and other systems
· Experience with containerization and orchestration (Docker, Kubernetes)
· Knowledge of data streaming technologies (e.g. Kafka)
· Familiarity with NoSQL databases
· Understanding of data privacy and security best practices
Benefits
What’s in it for you:
· Work in a fast-paced environment at a leading, trend-setting company with pioneer products.
· Competitive remuneration, based on your skills and experience.
· Ongoing learning opportunities and free participation in BOUSSIAS events and conferences