Staff AI Infrastructure Engineer
15~20K 人民币/每月
全职
5~10年
刷新于 1 年前
246 查看
51 申请
深圳
分享
工作职责
Job Responsibilities:
Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable solutions
Develop advanced AI/ML infrastructure solutions that enhance the efficiency of our skilled ML teams
Design and implement solutions for critical areas, including distributed storage systems, scheduling systems, high availability capabilities, and core reliability issues within our large-scale GPU clusters
Monitor and optimize the performance of our AI/ML infrastructure, ensuring high availability, scalability, and efficient resource utilization
Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks
Work with various teams, including ML developers, data engineers, and DevOps professionals, to create a cohesive and integrated AI/ML infrastructure ecosystem
职位要求
Minimum Skill Requirements:
Bachelor's degree in Computer Science, Engineering, or related technical field
5-8+ years of experience in software engineering, with a strong background in developing and managing large-scale distributed systems, ideally within the AI/ML infrastructure domain
Proficiency in programming languages such as Python, Go, or C++, with knowledge of cloud computing platforms like AWS, Azure, etc.
Strong communication and collaboration abilities, effective in working with diverse teams and individuals
Preferred Skill Requirements:
In-depth understanding of AI/ML workflows, including model training, data processing, and inference pipelines
Practical experience with containerization technologies (i.e., Docker, Kubernetes), automation tools (i.e., Ansible, Terraform), and monitoring solutions (i.e., Prometheus, Grafana)
Exceptional problem-solving skills, capable of analyzing complex systems, identifying bottlenecks, and implementing scalable solutions
A passion for continuous learning and staying abreast of new technologies and best practices in the AI/ML infrastructure space
相似的职位
搜索你理想的职位
职位类别
城市或国家
也看过
NPD(technical project manager)
15~20K 人民币/每月
全职
东莞
Agilian Technology
保存职位
0 查看
0 申请
刷新于 15 天前
Technical project manager(western preferred)
15~20K 人民币/每月
全职
深圳
Agilian Technology
保存职位
0 查看
0 申请
刷新于 15 天前
New product development manager(ME experience)
15~20K 人民币/每月
全职
东莞
Agilian Technology
保存职位
0 查看
0 申请
刷新于 15 天前
Overseas Accounting
15~20K 人民币/每月
全职
佛山
Yizumi Holdings Co., Ltd.
保存职位
0 查看
0 申请
刷新于 16 天前
The location/nature of the school: A school in Baiyun District, Guangzhou
15~20K 人民币/每月
全职
广州
Jiangsu Emily Consulting Service Co., LTD
保存职位
0 查看
0 申请
刷新于 25 天前
Brand Director
15~20K 人民币/每月
深圳
Anker Innovations LTD
保存职位
0 查看
0 申请
刷新于 1 个月前
IB primary school: math teacher
15~20K 人民币/每月
全职
广州
Jiangsu Emily Consulting Service Co., LTD
保存职位
0 查看
0 申请
刷新于 1 个月前
Shenzhen Great Benefits Homeroom Kindergarten Teacher
15~20K 人民币/每月
全职
深圳
BeneBaby Kindergarten, Futian District, Shenzhen
保存职位
0 查看
0 申请
刷新于 1 个月前
Application Development Manager
15~20K 人民币/每月
深圳
Envalior
保存职位
0 查看
0 申请
刷新于 1 个月前
Part-Time Skincare Live Streaming Host
15~20K 人民币/每月
兼职
广州
Guangzhou Yilu E-Commerce Co., Ltd.
保存职位
0 查看
0 申请
刷新于 1 个月前





