Senior AI Engineer
Listed on 2026-02-28
-
IT/Tech
AI Engineer, Data Engineer
Company Description
Global Fin Tech company building large-scale AI infrastructure for financial systems.
Job DescriptionJoin a market-leading Fin Tech company building and scaling large-scale foundation models to enhance AI capabilities across financial products. You’ll work on pre-training, architecture design, and optimisation of advanced models, contributing directly to research and production systems.
This is a research-to-production AI engineering role focused on scalable model training and performance.
The OpportunityYou’ll work with distributed GPU clusters at significant scale, collaborating with global engineers to push forward foundation model performance and efficiency. The role offers autonomy over experimentation, architecture decisions, and the transition of research into deployable systems.
This is an opportunity to shape how large-scale AI models are built and applied within financial infrastructure.
What you will do- Lead end-to-end pre-training efforts, including curation of datasets, training runs, monitoring metrics, and post-run analysis to ensure optimal model performance.
- Design and prototype cutting-edge model architectures, focusing on both transformer and non-transformer approaches to enhance intelligence and efficiency.
- Optimize computational performance through techniques like profiling, kernel-level optimization, mixed precision, and effective distributed training strategies.
- Drive reproducible research workflows with careful experiment tracking, translating findings into improved baseline models and production-ready training methodologies.
- Collaborate with infrastructure teams to refine training systems and ensure scalability across target platforms, ultimately enhancing the overall efficiency of the processes.
- Contribute to internal knowledge-sharing and external publications by sharing research outcomes when appropriate.
- You have extensive experience with Python and hands-on expertise in libraries like PyTorch and Hugging Face, and you're ready to tackle performance bottlenecks head-on.
- You've got a solid background in training large-scale models and are familiar with multi-modal applications in AI.
- Your skills include the ability to design and optimize innovative model architectures, ensuring they are efficient and impactful in production environments.
- You are familiar with the nuances of distributed training, scalable systems, and the importance of reproducibility in research.
- Collaboration is second nature for you; you thrive in a remote team where knowledge and ideas can be exchanged freely.
- You’re a self-starter who enjoys the challenge of pushing the boundaries of what's possible in AI research and implementation.
Our client is an innovative fintech firm building the infrastructure for the financial systems of tomorrow, focusing on expanding their product offerings through cutting-edge AI technologies.
Python . PyTorch . Hugging Face
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: