📍 Local Job Near You
Senior AI Platform Engineer (Hybrid in NYC or CT)
Insight Global
📍
New York, United States
Location
New York
Posted
June 10, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
Job Description
We are seeking a Senior AI Platform Engineer to design, build, and operate scalable AI/ML platform infrastructure on AWS, with a strong emphasis on platform reliability, visibility, and observability.
In this role, you will enable data scientists and application teams to safely deploy and operate AI workloads by providing resilient infrastructure, standardized tooling having deep operational insight across environments.
This is a hands-on senior engineering role that blends cloud infrastructure, DevOpsSec principles, and AI platform enablement.
Responsibilities
AI Platform & AWS Infrastructure
Design, build, and operate a cloud‑native AI/ML platform on AWS supporting training, inference, and experimentation workloads, spanning orchestration layers, agents, MCP tools, internal APIs, and databases.
Build and maintain core multi‑tenant services that enable the development, testing, deployment, monitoring, and lifecycle management of LLM‑bas...
We are seeking a Senior AI Platform Engineer to design, build, and operate scalable AI/ML platform infrastructure on AWS, with a strong emphasis on platform reliability, visibility, and observability.
In this role, you will enable data scientists and application teams to safely deploy and operate AI workloads by providing resilient infrastructure, standardized tooling having deep operational insight across environments.
This is a hands-on senior engineering role that blends cloud infrastructure, DevOpsSec principles, and AI platform enablement.
Responsibilities
AI Platform & AWS Infrastructure
Design, build, and operate a cloud‑native AI/ML platform on AWS supporting training, inference, and experimentation workloads, spanning orchestration layers, agents, MCP tools, internal APIs, and databases.
Build and maintain core multi‑tenant services that enable the development, testing, deployment, monitoring, and lifecycle management of LLM‑bas...