More jobs:
Pharmaceutical Data Engineer; GMP & Clinical Manufacturing
Job in
Indiana Borough, Indiana County, Pennsylvania, 15705, USA
Listed on 2026-02-28
Listing for:
CloudIngest
Full Time
position Listed on 2026-02-28
Job specializations:
-
IT/Tech
Data Engineer
Job Description & How to Apply Below
Role Overview
The Data Engineer will be responsible for designing, building, and maintaining data pipelines that extract data from source systems, transform it through the medallion architecture layers, and prepare it for consumption by the analytics layer. This role will work closely with the Power BI Developer to ensure data is properly structured, documented, and accessible for reporting and analytics.
Key Responsibilities- Following the Enterprise
DB (EDB) standard, design and establish AWS S3 bucket structure for data lake (Bronze/Silver/Gold zones) with Red CCI security controls - Build new data pipelines to extract data from the selected SaaS-based HSE systems
- Enhance/extend data pipelines using the dataset from AWS EDB Asset-related data products and digital solutions
- MES (Manufacturing Execution System) selection to be finalized by the end of Jan 2027. Need to build a data ingestion pipeline for selected MES via REST API (Lambda-based extraction). If a different MES is chosen, change course accordingly
- Data pipeline to SaaS based HSE tools to integrate data into the data warehouse
- Implement CDC pipeline for Lab Vantage LIMS using AWS DMS from Oracle database (or Azure Data Factory)
- Develop Bronze-to-Silver transformations using AWS Glue or Azure Data Factory, depending on data domains
- Configure AWS Glue Data Catalog with appropriate metadata and Red CCI classification tags
- Connect to the EDB marketplace for enterprise reference data
- Build Silver-to-Gold transformations, creating batch-centric data products
- Implement data quality checks and monitoring dashboards
- Develop orchestration workflows using AWS Step Functions
- Support the Power BI Developer in validating One Lake shortcut connectivity to the Gold zone
- Document data lineage, schema definitions, and pipeline architecture
- Monitor pipeline health, troubleshoot failures, and optimize performance
- Collaborate with the Power BI Developer on data model requirements and data quality issues
- Support data governance and security compliance reviews
- Respond to ad-hoc data requests from engineers
- Coordinate with the enterprise EDB team on data sharing agreements and standards
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×