Orbis Operations

Junior Data Scientist

11 October 2024
Apply Now
Deadline date:
£50000 - £100000 / year

Job Description

Orbis Operations is seeking a motivated and detail-oriented Junior Data Scientist to support our growing team in the collection, transformation, and analysis of open source data. In this role, you will be responsible for building data pipelines, cleaning and organizing datasets, and providing data-driven insights to support government missions. While experience with AI and Large Language Models (LLMs) is a plus, this role focuses primarily on open source data collection and processing, offering an excellent opportunity for career growth in the data science field.

Key Responsibilities

Duties/Responsibilities

  • Assist in the collection, preprocessing, and transformation of large-scale open source datasets from various structured and unstructured sources.
  • Develop and maintain data pipelines to ensure efficient data flow and integration across different platforms.
  • Perform exploratory data analysis (EDA) to uncover patterns and insights in open source data that align with mission objectives.
  • Collaborate with data engineers, software developers, and mission stakeholders to understand data requirements and deliver tailored solutions.
  • Write clear, well-documented code to automate data processing tasks and streamline workflows.
  • Generate reports and visualizations that communicate findings to both technical and non-technical stakeholders.
  • Stay up to date with trends in open-source data tools, technologies, and methodologies to continuously improve data collection and pipelining processes.

Supervisory Responsibilities

This position has no supervisory responsibilities

Required Qualifications:

  • Bachelor’s degree in Computer Science, Data Science, Statistics, or related field, or equivalent work experience.
  • 1-3 years of experience working in data science, with a focus on open source data collection, cleaning, and pipelining.
  • Strong programming skills in Python, with familiarity in libraries such as Pandas, NumPy, and requests for data manipulation and processing.
  • Experience working with APIs and web scraping tools to collect and integrate data from open source platforms.
  • Understanding of data cleaning, transformation, and storage best practices.
  • Strong problem-solving skills, with an ability to manage multiple tasks and projects simultaneously.
  • Excellent communication skills, with the ability to work effectively in a team and present findings to non-technical audiences.
  • ACTIVE Security Clearance TS SCI w POLY

Preferred Qualifications

  • Experience working with cloud platforms (e.g., AWS, Google Cloud, Azure) for data storage and processing.
  • Familiarity with SQL or NoSQL databases for querying and managing datasets.
  • Knowledge of data visualization tools such as Matplotlib, Seaborn, or Power BI.
  • Experience with natural language processing (NLP) and Large Language Models (LLMs) is a plus, but not required.
  • Prior experience working on government or public sector projects.

Physical Requirements

  • Prolonged periods of sitting at a desk and working on a computer.
  • Routine video conference and/or in-person meetings.
  • Ability to attend planned meetings within the Washington Metro Area region.
We are an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender identity, sexual orientation, national origin, disability, or protected veteran status.

Location

McLean, VA


EWJP2