St. Jude Children’s Research Hospital

Data Scientist – BMTCT

31 October 2024
Apply Now
Deadline date:
£86000 - £154000 / year

Job Description

The Data Scientist role will have a direct impact on analytics needs of the entire institution, utilizing state-of-the-art technologies. This position will be responsible for using data to unlock trends and key insights that are critical for the success of St. Jude operations. This role will be expected to gain significant domain knowledge across administrative, clinical and basic science areas and help them make effective data based decisions. Will need to assist senior data scientists in developing presentations, charts and graphs to effectively communicate and illustrate results of key-findings, data analysis and data/science models as well as present them to customers including executive leadership teams. Perform all data pre-processing steps including but not limited to collecting data in collaboration with Data Engineers, data cleansing, normalization, feature selection and dimensionality reduction. Develop machine learning models or use stochastic models. Document all technical and business aspects associated with each project.

Bone Marrow Transplant and Cell Therapy (BMTCT) Program is actively seeking a versatile Data Scientist with a distinctive skill set that merges proficiency in data engineering, advanced data analytics, and statistical modeling. In this multifaceted role, you will play a pivotal role in harnessing data to optimize patient outcomes, enhance operational efficiency, and contribute to cutting-edge advancements in BMTCT. The Data Scientist role will have a direct impact on analytics needs of the entire Program, utilizing state-of-the-art technologies. While primarily focused on Bone Marrow Transplant and Cell Therapy (BMTCT), the Data Scientist will also collaborate extensively with departments such as Oncology, Survivorship Care Plan, Epidemiology (EPI), Infectious Diseases, Hematology, and the Center of Excellence for Pediatric Immuno-Oncology (CEPIO) to unlock the full potential of data analytics in healthcare. This position will also be responsible for using data to unlock trends and key insights that are critical for the success of BMTCT operations. This role will be expected to gain significant domain knowledge across administrative, clinical and basic science areas and help them make effective data-based decisions. Will need to assist Principal Investigators in developing presentations, charts and graphs to effectively communicate and illustrate results of key-findings, data analysis and data/science models as well as present them to customers. Perform all data pre-processing steps including but not limited to collecting data in collaboration with St Jude Data Engineers, data cleansing, normalization, feature selection and dimensionality reduction. Develop machine learning models or use stochastic models. Document all technical and business aspects associated with each project. This position requires a blend of Advanced analytical expertise, Data Engineering concepts, and a keen understanding of business objectives.

Job Responsibilities:

Data Analytics:

  • Conduct exploratory and advanced data analysis to extract actionable insights.
  • Develop and implement statistical models for predictive and prescriptive analytics.
  • Create visualizations to communicate complex findings effectively.
  • Utilize advanced statistical techniques and machine learning algorithms to analyze large datasets related to oncology, epidemiology, infectious diseases, and pediatric immuno-oncology.

Data Engineering:

  • Design, build, and optimize end-to-end data architectures and pipelines.
  • Implement scalable ETL (Extract, Transform, Load) processes for data integration.
  • Utilize programming languages such as Python, Java, or Scala for data engineering tasks.

Data Collection and Integration:

  • Collect, clean, and preprocess data from various sources, ensuring data quality and reliability.
  • Integrate and consolidate disparate data sets for comprehensive analysis.

Exploratory Data Analysis (EDA):

  • Conduct exploratory data analysis to understand the patterns, trends, and distributions within the data.
  • Use statistical techniques to identify outliers and key features in the dataset.

Data Visualization:

  • Create visually compelling and informative dashboards and reports to communicate complex findings to stakeholders.

Machine Learning Model Development:

  • Design, build, and optimize machine learning models for various applications.
  • Select appropriate algorithms and techniques based on the nature of the problem.

Model Evaluation and Validation:

  • Evaluate model performance using appropriate metrics and validation techniques.
  • Fine-tune models to improve accuracy, precision, recall, or other relevant metrics.

Predictive Analytics:

  • Apply predictive analytics to forecast trends, behaviors, or outcomes based on historical data.
  • Implement models that support decision-making processes.

Collaboration:

  • Work closely with data scientists, business analysts, and other team members to understand data requirements and contribute to data-driven decision-making.
  • Collaborate with IT teams to ensure seamless integration between data engineering and analytics processes.
  • Act as a key liaison between the Oncology, Epidemiology (EPI), Infectious Diseases, and the Center of Excellence for Pediatric Immuno-Oncology (CEPIO).
  • Foster collaboration and knowledge exchange by facilitating cross-departmental meetings, workshops, and data-sharing initiatives.

Data Quality Assurance:

  • Implement data quality checks to ensure accuracy and reliability of analytics results.
  • Establish and enforce data governance practices to maintain data quality standards.

Big Data Technologies:

  • Utilize big data technologies such as Hadoop, Spark, and Kafka for large-scale data processing.
  • Optimize data processing and storage for performance and scalability.

Continuous Learning:

  • Stay updated on industry best practices, emerging trends, and advancements in both data analytics and data engineering.
  • Engage in continuous learning to enhance skills and stay at the forefront of technologies.

Maintains working knowledge of all IS management methodologies and ensures staff development in this area:

  • Understands and is able to apply all IS management methodologies (software, policies/ procedures, etc.) on a daily basis, as applicable.
  • Actively pursues new opportunities to enhance IS management methodologies.
  • Ensures staff adheres to standards.
  • Identifies opportunities for staff to enhance their skills and knowledge to support IS management methodologies.

Builds software components in accordance with the relevant requirements, system and software architectures, design, and coding standards

Analyze design specifications, documentation, and requirements surrounding the data science technology components

Maintains regular and predictable attendance

Performs other related duties as assigned in order to meet the goals of the department and institution

Minimum Education and/or Training:

  • Master’s degree in Statistics/Mathematics/Business Analytics or related fields is required

Minimum Experience:

  • Four (4) years of experience in data analytics/reporting required, with at least two (2) of those years of experience in exclusive data science role. Two (2) years in a data scientist role required with a doctoral degree (PhD). • Clinical/Health Care domain experience preferred • Must possess implementation experience of data science solutions using R/Python/Scala, Matlab, RapidMiner, SAS, Spark, Excel or related technologies

Licensure, Registration and/or Certification Required

  • R/Python/Data Scientist/ Azure/ SQL or other related certifications Preferred
  • EPIC Caboodle certification is Preferred or must be obtained within one year of employment.

Special Skills, Knowledge and Abilities:

  • Self-motivated independent and possess the ability to learn quickly
  • Must be a team player with ability to function as part of a team that will develop the EDW solution and work with different functional teams
  • Must possess multi-tasking/time management skills with the ability to operate under minimal guidance
  •  Problem-Solving Intellect, particularly challenging real-world problems
  •  Must possess excellent communication and presentation skills
  •  Durable Passion to apply data science to solve diverse problems
  •  Hypothesis-driven Research (make and test hypothesis; implement experiments)
  •  Statistical Methods (e.g., correlations and regressions, ANOVA, resampling, effect size)
  •  Machine-learning Techniques (e.g., clustering, decision trees, nearest-neighbors, support vector classifiers, ensemble methods, collaborative filtering)
  • Must have experience in data presentation and reporting tools like Tableau, Spotfire, Qlik, or Power BI
  • Must have experience working with unstructured data sets to unlock business value and insights
  • Experience developing NLP tools and State Machines using Apache Solr
  • Must possess advanced SQL skills and working knowledge of Microsoft SQL Server/Oracle/Tera Data or Spark
  • Experience with end-to-end machine learning process

Compensation

In recognition of certain U.S. state and municipal pay transparency laws, St. Jude is including a reasonable estimate of the compensation range for this role. This is an estimate offered in good faith and a specific salary offer takes into account factors that are considered in making compensation decisions including but not limited to skill sets, experience and training, licensure and certifications, and other business and organizational needs. It is not typical for an individual to be hired at or near the top of the salary range and compensation decisions are dependent on the facts and circumstances of each case. A reasonable estimate of the current salary range is $86,320 – $154,960 per year for the role of Data Scientist – BMTCT.

Explore our exceptional benefits!

Diversity, Equity and Inclusion

St. Jude Children’s Research Hospital has a diverse, global patient population and workforce, built on the principles of diversity, equity and inclusion. Our founder Danny Thomas envisioned a hospital that would treat children of the world—regardless of race, religion or a family’s ability to pay. Learn more about our history and commitment.

Today, we continue the mission to advance cures and means of prevention for pediatric catastrophic diseases through research and treatment. As we accelerate this progress globally, we believe our legacy of diversity, equity and inclusion is foundational to success. With the commitment of leaders at all levels of the organization, we strive to ensure the St. Jude culture, leadership approaches and talent processes are equitable and culturally responsive. View our Diversity, Equity and Inclusion Report to learn about the hospital’s roots in diversity, equity and inclusion, where we are today and our aspirations for an even better future.

St. Jude is an Equal Opportunity Employer

No Search Firms

St. Jude Children’s Research Hospital does not accept unsolicited assistance from search firms for employment opportunities. Please do not call or email. All resumes submitted by search firms to any employee or other representative at St. Jude via email, the internet or in any form and/or method without a valid written search agreement in place and approved by HR will result in no fee being paid in the event the candidate is hired by St. Jude.