Strong programming experience in Python and PySparkProficiency in SQL with expertise in complex query development and performance tuningHands on experience with AWS services including Redshift S3 EMR Glue and Glue CrawlerExpertise in building deploying and managing data pipelines in the AWS ecosystemExperience in shell scripting for automating data tasks and system interactions Preferred QualificationsExperience with AWS Lambda for serverless computingFamiliarity with other data processing frameworks like Apache Kafka Apache Airflow or similarKnowledge of Data Lakes Lakehouse architectures and modern data architecturesExperience in performance tuning of AWS Redshift PySpark and SQL queriesPrior experience working with Agile project management methodologies
Strong programming experience in Python and PySpark Proficiency in SQL with expertise in complex query development and performance tuning Hands on experience with AWS services including Redshift S3 EMR Glue and Glue Crawler Expertise in building deploying and managing data pipelines in the AWS ecosystem Experience in shell scripting for automating data tasks and system interactions
Preferred Qualifications Experience with AWS Lambda for serverless computing Familiarity with other data processing frameworks like Apache Kafka Apache Airflow or similar Knowledge of Data Lakes Lakehouse architectures and modern data architectures Experience in performance tuning of AWS Redshift PySpark and SQL queries Prior experience working with Agile project management methodologies