Role SummaryArchitect , Design, develop and maintain Starburst data pipelines and workflows to integrate and analyse data from various sources like, MSSQL, MongoDB (complex collections), Hive, Amazon S3 and enterprise data sources.Collaborate with data engineers, domain product owners, analysts, and other stakeholders to understand data requirements and develop optimal solution on SEP.Optimize Starburst queries, configurations, and performance to ensure efficient data processing and retrieval.Author and maintain detailed documentation for data pipelines, configurations, tuning parameters, and processes for future maintenance.Troubleshoot and resolve issues related to data integration, entitlements, and performance.Refer upstream domain data models to design appropriate data integration.Mandatory skills Proven experience as Starburst developer, integrating various relational and No-SQL sources including complex MongoDB collections.Familiarity with data integration, ETL processing and data warehousing conceptsMust have basic (hands-on) knowledge and understanding in with MS SQL Server/ Oracle, MongoDB, S3 and HiveExposure to Data Modelling - dimensional/reporting, data warehousing and transactional use cases (Physical and Logical modelling)Knowledge of reporting and query tools and practicesPreferred skills10 + Years of experience in Data Mesh/ Starburst, Architecture, Design, Design Patterns, Data-driven transformations, Artificial Intelligence, Data Governance & Management, Cloud TechnologiesA 3+ years experience in development on Starburst Technology stack.
Design the overall Data architecture , defining all domains , schemas and integration patterns that support the principles of Data mesh. Collaborate with Data Engineers to implement scalable and flexible data pipelines and storage solutions that enables seamless data integration and interoperability Provides guidance and expertise on data modelling , metadata management and data governance practices. Experience in Managing data as a product, defining its vision roadmap & prioritization based on the business objectives. Collaborate with stakeholders to understand their requirements and ensure that products deliver value to the customer. Monitor performance metrics and gather feedback 10 + Years of experience in Data Mesh/ Starburst, Architecture, Design, Design Patterns, Data-driven transformations, Artificial Intelligence, Data Governance & Management, Cloud Technologies Data Engineering: Strong SQL, Apache Spark, Python, Hive, Iceberg, Big Data Ecosystem, Starburst/Trino, Airflow, Kafka AWS Services related to data and analytics implementing Data Lakehouse solutions in AWS Working Experience on High Available and Scalable Systems Data Strategy, Azure AI, Data Governance, Data Platform, Asset Rationalization, Migration Strategy, Technology Landscape Assessment