Data Engineer Position SummaryThe Data Engineer is responsible for building and maintaining data pipelines ensuring the smooth operation of data systems and optimizing workflows to meet business requirements This role will support data integration and processing for various applicationsMinimum Qualifications6 Years overall IT experience with minimum 4 years of work experience in below tech skillsTech Skills Proficient in Python scripting and PySpark for data processing tasksStrong SQL capabilities with hands on experience managing big data using ETL tools like InformaticaExperience with the AWS cloud platform and its data services including S3 Redshift Lambda EMR Airflow Postgres SNS and EventBridgeSkilled in BASH Shell scriptingUnderstanding of data lakehouse architecture particularly with Iceberg format is a plusPreferred Experience with Kafka and Mulesoft APIUnderstanding of healthcare data systems is a plusExperience in Agile methodologiesStrong analytical and problem solving skillsEffective communication and teamwork abilitiesResponsibilitiesDevelop and maintain data pipelines and ETL processes to manage large scale datasetsCollaborate to design test data architectures to align with business needsImplement and optimize data models for efficient querying and reportingAssist in the development and maintenance of data quality checks and monitoring processesSupport the creation of data solutions that enable analytical capabilitiesContribute to aligning data architecture with overall organizational solutions
Data Engineer
Position Summary The Data Engineer is responsible for building and maintaining data pipelines ensuring the smooth operation of data systems and optimizing workflows to meet business requirements This role will support data integration and processing for various applications
Minimum Qualifications 6 Years overall IT experience with minimum 4 years of work experience in below tech skills Tech Skills Proficient in Python scripting and PySpark for data processing tasks Strong SQL capabilities with hands on experience managing big data using ETL tools like Informatica Experience with the AWS cloud platform and its data services including S3 Redshift Lambda EMR Airflow Postgres SNS and EventBridge Skilled in BASH Shell scripting Understanding of data lakehouse architecture particularly with Iceberg format is a plus Preferred Experience with Kafka and Mulesoft API Understanding of healthcare data systems is a plus Experience in Agile methodologies Strong analytical and problem solving skills Effective communication and teamwork abilities
Responsibilities Develop and maintain data pipelines and ETL processes to manage large scale datasets Collaborate to design test data architectures to align with business needs Implement and optimize data models for efficient querying and reporting Assist in the development and maintenance of data quality checks and monitoring processes Support the creation of data solutions that enable analytical capabilities Contribute to aligning data architecture with overall organizational solutions