πŸ“ Jobs Near Me
πŸ“

HiringNearMe.work

Local Jobs, Zero Commute

πŸ“ Local Job Near You

On-prem Platform Engineer

🏒
Apolis
πŸ“ Charlotte, North Carolina, United States
πŸ“
Location Charlotte, North Carolina
πŸ“…
Posted May 16, 2026
πŸš—
Commute Local Area
🎯
Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

πŸ“‹
Job Description

On-prem Platform Engineer



Location: Charlotte, NC





Key Skills:



Must-Have Skills (Mandatory Keywords)



LLM Inference & Optimization




  • vLLM, TensorRT-LLM, Triton Inference Server, SGLang

  • Inference optimization techniques:

    • Continuous batching

    • Speculative decoding

    • KV cache / Prefix caching



  • Model optimization:

    • FP8, AWQ, GPTQ





Distributed & GPU Systems




  • Tensor parallelism and large model scaling

  • CUDA, NCCL, GPU architecture

  • GPU partitioning & optimization (MIG)



Kubernetes & ML Serving




  • Kubernetes-based ML serving...

Apply for This Job

Submit Application

Quick and secure application process

πŸ“ Location Details

πŸŒ†
City
Charlotte, North Carolina
πŸ—ΊοΈ
Country
United States
πŸš—
Commute
Local Area

πŸ” More Jobs Nearby

Explore other opportunities in Charlotte, North Carolina

View Local Jobs