Senior Data Engineer
Data Integration Team – Senior Data Engineer About the Data Integration Team (DIT) The Data Integration Team (DIT) is responsible for onboarding and integrating healthcare data from hospitals into the IQVIA ecosystem. We design, build, and operate robust ETL pipelines that transform complex clinical data into structured, high-quality datasets used by downstream applications such as data warehouses, analytics platforms, and machine learning solutions. Our work primarily focuses on on-premise hospital environments, interacting with a wide variety of Electronic Health Record (EHR) systems and heterogeneous data sources. We collaborate closely with the Data Management team, Product, AI/ML, and Support teams to ensure reliable data delivery and continuous improvement of our integration processes. DIT operates at the intersection of engineering, healthcare, and operations, combining delivery, troubleshooting, and continuous optimization of ETL pipelines across multiple international projects. You will work within a standardized ETL framework used across hospital integrations—extending reusable patterns while adapting to site-specific source systems, data models, and deployment constraints. What we are looking for We are looking for a Senior Data Engineer who combines strong technical expertise with ownership and a proactive mindset. You will play a key role in designing and developing data integration pipelines, while also supporting ongoing operations and contributing to improvements in scalability, reliability, and maintainability. In addition to hands-on development, we are looking for someone who can: Guide technical decisions Support and mentor other engineers Contribute to structuring and improving team processes Your Responsibilities Design, build, and maintain ETL pipelines to integrate hospital data into the IQVIA ecosystem Work with complex healthcare data sources (EHR systems, flat files, APIs) and map them to target data models Extend and maintain shared ETL frameworks, mapping configurations, and reusable patterns across hospital projects Execute and monitor historical and incremental data loads, ensuring data quality and reliability Implement data validation and quality checks as part of pipeline design Troubleshoot ETL failures, performance issues, and data inconsistencies across environments Collaborate with cross-functional teams (DM, Product, AI/ML, Support) to align on requirements and delivery timelines Contribute to standardizing ETL patterns, improving monitoring, and reducing operational overhead Support deployments and environment setup in on-premise hospital infrastructures Provide technical guidance and mentoring to other team members where needed Participate in planning, prioritization, and continuous improvement of team processes Operating model This is a hands-on senior role with a mix of project delivery and production support. You will spend meaningful time building and extending pipelines, while also troubleshooting live integrations, supporting incremental and historical loads, and helping stabilize deployments in on-premise hospital environments. Remote-first collaboration is the norm; occasional access to hospital systems via VPN or on-site setup may be required depending on the project. Your Profile You are a hands-on engineer with a strong sense of ownership and curiosity. You enjoy working on complex data problems and navigating imperfect or evolving environments. You are comfortable balancing delivery, support, and technical improvement, and you can clearly communicate technical concepts to both technical and non-technical stakeholders. You thrive in collaborative environments and are motivated to improve not only systems, but also the way teams work. Requirements 5 years of experience in data engineering, backend engineering, or a closely related field, with significant hands-on ETL exposure Strong experience with Python for data engineering (modular, testable, scalable code) Solid experience building and maintaining ETL pipelines end-to-end in production Strong SQL skills (DDL/DML) and experience with relational databases (Microsoft SQL Server and/or PostgreSQL) Experience integrating or transforming clinical or healthcare datasets Experience troubleshooting data pipelines and working in production environments Ability to handle complex data mappings and transformations Experience working with on-premise environments and infrastructure constraints Good understanding of data pipeline lifecycle (historical loads, incremental loads, monitoring, error handling) Experience designing reusable ETL patterns that others can maintain and extend Ability to communicate clearly with technical and non-technical stakeholders Experience working in collaborative teams and contributing to technical discussions Senior-level experience with willingness to take ownership and guide others Nice to have Experience with orchestration tools (e.g. Prefect, Airflow, Luigi) Knowledge of healthcare data standards (FHIR, OMOP) Experience with monitoring and support models for data pipelines Familiarity with EHR systems and hospital source data models Experience with DevOps practices (CI/CD, automation, containerization) Exposure to data quality frameworks and validation strategies Experience with schema migrations and versioned database changes Experience mentoring or leading small technical initiatives Opportunities Work on real-world healthcare data, contributing to improved patient outcomes and clinical insights Gain experience with complex, large-scale data integration challenges across multiple hospitals Help shape a standardized integration platform used across international hospital deployments Collaborate with cross-functional teams across engineering, product, analytics, and operations Drive improvements in ETL architecture, monitoring, and operational processes Work at the intersection of clinical data standards (FHIR/OMOP) and modern Python ETL engineering Exposure to international projects and diverse healthcare systems SKILLS LIST: Python SQL Relational databases — Microsoft SQL Server, PostgreSQL ( ETL / ELT Data modeling & mapping Data pipeline lifecycle DevOps IQVIA is a leading global provider of clinical research services, commercial insights and healthcare intelligence to the life sciences and healthcare industries. We create intelligent connections to accelerate the development and commercialization of innovative medical treatments to help improve patient outcomes and population health worldwide. Learn more at https://jobs.iqvia.com IQVIA is committed to integrity in our hiring process and maintains a zero tolerance policy for candidate fraud. All information and credentials submitted in your application must be truthful and complete. Any false statements, misrepresentations, or material omissions during the recruitment process will result in immediate disqualification of your application, or termination of employment if discovered later, in accordance with applicable law. We appreciate your honesty and professionalism. The potential base pay range for this role, when annualized, is zł177,500.00 - zł329,600.00. The actual base pay offered may vary based on a number of factors including job-related qualifications such as knowledge, skills, education, and experience; location; and/or schedule (full or part-time). Dependent on the position offered, incentive plans, bonuses, and/or other forms of compensation may be offered, in addition to a range of health and welfare and/or other benefits. Apply To This Job