Senior Data Scientist
Listed on 2026-03-12
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Data Scientist, Data Analyst
Senior Data Scientist I
Are you interested in working with data and analytics to solve problems?
Are you interested in bringing your Gen AI, Machine Learning and NLP expertise to projects?
About our Team:Data Science Health team works with a focus on Generative AI, Agentic AI, Machine Learning, Natural Language Processing, and Statistical techniques to build state of the art applications for the health sciences domain.
About the Role:As a Senior Data Scientist, you will play a pivotal role in the development and deployment of cutting-edge Generative AI models and solutions. You will be responsible for building, testing, and maintaining our Generative AI, Retrieval Augmented Generation (RAG), Agentic AI and Natural Language Processing (NLP) solutions. This includes evaluating their performance and implementing guardrails to ensure ethical and responsible use of AI technologies.
You will engage in the entire life cycle of data science projects, including design, implementation, evaluation, productionisation and ongoing enhancement. A key focus of your work will be on the customization and optimization of existing RAG pipelines to support applications that involve content ingestion, machine translation, and contextualized information retrieval.
Experience with end-to-end model deployment, including leveraging AI agents, Model Context Protocol (MCP) for effective context management, and cloud platforms such as AWS (including AWS Bedrock), Azure, or similar services, is a strong plus. Your deliverables will include efficient, production-ready Python code, with experience in Java considered an asset. You will collaborate closely with Subject Matter Experts (SMEs) and the technology team to deploy and operationalize our data science pipelines.
This role requires a strong foundation in Natural Language Processing (NLP), Machine Learning, Transformer models and Generative AI, as well as proficiency in Python.
- Collect data, perform data analysis, develop models, define quality metrics, and conduct quality assessments of models, along with regular presentations to stakeholders.
- Create production-ready Python packages for each component of data science pipelines (e.g., pre-processing, model inference, evaluation) and coordinate their deployment with the technology team.
- Design, develop, and deploy Generative AI models and solutions that meet specific business needs.
- Expertise in Retrieval Augmented Generation (RAG) optimization and customization of existing RAG pipelines to meet specific project needs.
- Proficiency in large-scale data ingestion, preprocessing, and transformation of multilingual content to ensure high-quality inputs for downstream models.
- Experience building Agentic RAG systems is strong requirements.
- Experience in Lang Chain, Auto Gen, Haystack, MCP or similar AI agent management tools.
- Fine-tune large language models (LLMs) and transformer models to enhance accuracy and relevance.
- Implement guardrails and evaluation mechanisms to ensure responsible and ethical AI usage.
- Conduct rigorous testing and evaluation of AI models to ensure high performance and reliability.
- Integrate data science components and ensure end-to-end quality assessment.
- Maintain the robustness of data science pipelines against model drift and ensure consistent output quality.
- Establish a reporting process for pipeline performance and develop automatic re-training strategies for existing pipelines.
- Work collaboratively with cross-functional teams to integrate AI solutions into existing products and services.
- Mentor junior data scientists and contribute to the knowledge-sharing culture within the team.
- Stay up-to-date with the latest advancements in AI, machine learning, and NLP technologies.
- Master’s or Ph.D. in Computer Science, Data Science, Artificial Intelligence, or a related field.
- 7+ years of relevant applied experience in data science, with a focus on Generative AI, NLP, and machine learning.
- Proficiency in Python for data analysis, model development, and deployment.
- Strong experience with transformer models and fine-tuning techniques for large language models (LLMs).
- Proficiency in Generative AI…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).