Staff AI Infrastructure Engineer

15~20K 人民币/每月

全职
5~10年
刷新于 1 年前
271 查看
51 申请
深圳
分享
工作职责
Job Responsibilities: Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable solutions Develop advanced AI/ML infrastructure solutions that enhance the efficiency of our skilled ML teams Design and implement solutions for critical areas, including distributed storage systems, scheduling systems, high availability capabilities, and core reliability issues within our large-scale GPU clusters Monitor and optimize the performance of our AI/ML infrastructure, ensuring high availability, scalability, and efficient resource utilization Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks Work with various teams, including ML developers, data engineers, and DevOps professionals, to create a cohesive and integrated AI/ML infrastructure ecosystem
职位要求
Minimum Skill Requirements: Bachelor's degree in Computer Science, Engineering, or related technical field 5-8+ years of experience in software engineering, with a strong background in developing and managing large-scale distributed systems, ideally within the AI/ML infrastructure domain Proficiency in programming languages such as Python, Go, or C++, with knowledge of cloud computing platforms like AWS, Azure, etc. Strong communication and collaboration abilities, effective in working with diverse teams and individuals Preferred Skill Requirements: In-depth understanding of AI/ML workflows, including model training, data processing, and inference pipelines Practical experience with containerization technologies (i.e., Docker, Kubernetes), automation tools (i.e., Ansible, Terraform), and monitoring solutions (i.e., Prometheus, Grafana) Exceptional problem-solving skills, capable of analyzing complex systems, identifying bottlenecks, and implementing scalable solutions A passion for continuous learning and staying abreast of new technologies and best practices in the AI/ML infrastructure space
搜索你理想的职位
职位类别
城市或国家
也看过
PE positions in Aug// High School PE Teachers Needed in August in Changchun city, Jilin province and Wuxi city, Jiangsu province;
15~20K 人民币/每月
全职
长春
Shanghai Bowai Education
保存职位
0 查看
0 申请
刷新于 14 天前
海外市场负责人
360~400K 人民币/每年
全职
深圳
GEOR Global Recruitment (Shenzhen) Ltd.
保存职位
猎头职位
刷新于 1 个月前
Intern Accountant 實習會計師
15~20K 人民币/每月
全职
深圳
Manulife
保存职位
0 查看
0 申请
刷新于 1 个月前
Tencent Cloud Product Manager Intern x 15 Vacancies
15~20K 人民币/每月
全职
深圳
Tencent
保存职位
0 查看
0 申请
刷新于 1 个月前
ASAP//Great training center jobs in Fengman district, Jilin City
15~20K 人民币/每月
全职
吉林
Shanghai Bowai Education
保存职位
0 查看
0 申请
刷新于 2 个月前
Mechanical Design Engineer x 3 Vacancies
15~20K 人民币/每月
全职
深圳
Daimon Robotics Technology Co., Ltd.
保存职位
0 查看
0 申请
刷新于 2 个月前
Global Marketing & Product Intern x 2 Vacancies
15~20K 人民币/每月
全职
深圳
OneOneTalk Limited
保存职位
0 查看
0 申请
刷新于 2 个月前
Finance BP
15~20K 人民币/每月
全职
滁州, 惠州
Toneluck Industrial (Hui Zhou) Ltd.
保存职位
0 查看
0 申请
刷新于 2 个月前
International Logistics Product Specialist
15~20K 人民币/每月
全职
深圳
NIUKU (SHENZHEN) INTERNATIONAL LOGISTICS LIMITED
保存职位
0 查看
0 申请
刷新于 2 个月前
Chief Equipment Engineer
15~20K 人民币/每月
全职
深圳
Jiangsu Linyang Solar Co., Ltd
保存职位
0 查看
0 申请
刷新于 3 个月前

最新博客

职位
人才
博客
我的