Data Software Engineer
Job in
New York, New York County, New York, 10261, USA
Listed on 2026-01-27
Listing for:
Alldus International Consulting Ltd
Full Time
position Listed on 2026-01-27
Job specializations:
-
IT/Tech
Data Engineer, Machine Learning/ ML Engineer, Data Scientist, Data Science Manager
Job Description & How to Apply Below
Location: New York
Our client, an AI-driven organization within the Healthcare industry, is hiring a Staff Data Software Engineer to join their team in New York. The successful candidate will design, build and scale the data infrastructure that underpins agent improvement, clinical analytics and research collaboration. You will own streaming and batch pipelines to process agent conversations, clinical events and patient outcomes at scale.
Responsibilities- Build and operate streaming and batch data pipelines on Databricks using Spark and Delta Lake.
- Design, implement and maintain CDC (Change Data Capture) pipelines that sync operational databases into Delta Lake.
- Develop data mining pipelines for persona discovery, scenario extraction and edge-case detection.
- Build and own the data backend for the Research Platform, including natural-language-to-SQL capabilities.
- Implement robust data quality checks, staleness detection and automated alerting.
- Develop pipelines for voice and SMS analytics, including call quality and engagement metrics.
- Support multi-region data deployments and compliance requirements.
- Collaborate closely with agent engineers and data scientists to surface insights that improve agent performance.
- At least 4 years of experience in production data engineering roles.
- Deep, hands-on experience with Databricks, Spark and Delta Lake.
- Strong proficiency in Python and SQL for building and maintaining data pipelines.
- Experience designing and operating streaming pipelines and CDC (change data capture) systems.
- A solid understanding of data modelling, medallion architectures (bronze/silver/gold) and query optimisation.
- Experience implementing data quality frameworks, monitoring and alerting.
- A proven track record of delivering reliable, production-grade data infrastructure.
- Exposure to machine learning pipelines, including feature engineering and training infrastructure is desirable.
- Experience building natural-language query interfaces or LLM-powered data tools is a bonus.
- Experience working with healthcare data and familiarity with HIPAA compliance requirements is a plus.
- Salary: $220k - $260k
- Health, dental and vision coverage.
- Mental Health and Wellness support.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×