As a Generative AI Platform Support Engineer you will be responsible for providing technical support for our AI platform focusing on the integration of cloud infrastructure deployment and ongoing maintenance You will work closely with cross functional teams to troubleshoot technical issues implement platform enhancements monitor system performance and ensure the platform runs efficiently and effectively Your role will leverage expertise in AWS Cloud Administration and Infrastructure management to support platform operations and ensure optimal system performanceKey ResponsibilitiesAssess and enhance the AI platforms cloud infrastructure and data pipeline resilience using AWS and cloud based technologiesEnsure scalability and fault tolerance of AI ML models within cloud environmentsIdentify and resolve bottlenecks in model inference and training pipelines focusing on performance and resource optimizationOptimize cloud resource utilization on AWS for real time use cases including AI model deploymentCollaborate with the DevOps team on improving cloud deployment processes and managing AWS infrastructureImplement automated testing to simulate fault tolerance and ensure high availabilityProvide ongoing technical support for users of the Generative AI platform troubleshooting issues and responding to queries to ensure seamless operationsMonitor cloud platform performance on AWS identifying and implementing optimization strategies to improve cost efficiency and scalabilityWork with AWS cloud services eg EC2 S3 Lambda VPC to ensure proper configuration management and performanceDocument key processes issues and solutions for knowledge sharing and future referenceStay updated with industry trends in Generative AI cloud technologies and AWS cloud administration
As a Generative AI Platform Support Engineer you will be responsible for providing technical support for our AI platform focusing on the integration of cloud infrastructure deployment and ongoing maintenance You will work closely with cross functional teams to troubleshoot technical issues implement platform enhancements monitor system performance and ensure the platform runs efficiently and effectively Your role will leverage expertise in AWS Cloud Administration and Infrastructure management to support platform operations and ensure optimal system performance Key Responsibilities Assess and enhance the AI platforms cloud infrastructure and data pipeline resilience using AWS and cloud based technologies Ensure scalability and fault tolerance of AI ML models within cloud environments Identify and resolve bottlenecks in model inference and training pipelines focusing on performance and resource optimization Optimize cloud resource utilization on AWS for real time use cases including AI model deployment Collaborate with the DevOps team on improving cloud deployment processes and managing AWS infrastructure Implement automated testing to simulate fault tolerance and ensure high availability Provide ongoing technical support for users of the Generative AI platform troubleshooting issues and responding to queries to ensure seamless operations Monitor cloud platform performance on AWS identifying and implementing optimization strategies to improve cost efficiency and scalability Work with AWS cloud services eg EC2 S3 Lambda VPC to ensure proper configuration management and performance Document key processes issues and solutions for knowledge sharing and future reference Stay updated with industry trends in Generative AI cloud technologies and AWS cloud administration