Machine Learning - Model Serving Job at Alexander Chapman, San Francisco, CA

QjA1OG9mWVp4RkFRMWdJYnpZaHl2RTh3M1E9PQ==
  • Alexander Chapman
  • San Francisco, CA

Job Description

We are working with a company building intuitive, voice-first AI systems that blend natural interaction with powerful model performance. Founded by leaders from Meta, Oculus, and Google, they’re creating a new class of consumer devices powered by speech, vision, and LLMs.

The Role

You’ll help optimize and scale the inference stack, working across model serving, performance tuning, and deployment to support real-time, multimodal AI.

What You’ll Do

  • Improve serving systems for LLMs, speech, and vision models.
  • Optimize throughput, latency, and cost using advanced techniques like batching, caching, and kernel tuning.
  • Extend frameworks like VLLM or SGLang to push the limits of performance.
  • Collaborate with training teams to deploy faster, lighter models.
  • Experiment with compilers and hardware backends to boost efficiency.

What We’re Looking For

  • Strong experience with PyTorch or similar ML frameworks.
  • Deep knowledge of model serving and systems performance.
  • Skilled in low-level debugging, bottleneck analysis, and server optimization.
  • Familiar with VLLM, Ray, or deploying inference workloads at scale.
  • Comfortable owning complex infrastructure projects end to end.
  • Background in computer science or related field from a top-tier university (e.g. Stanford, MIT, Ivy League).
  • Experience at a top tech company (e.g. FAANG) or a successful, high-growth startup.

They’re looking for curious, impact-driven engineers ready to push what’s possible with real-time AI.

Job Tags

Similar Jobs

Childhood Cancer Society

Sound Designer - AI & Voice Development Specialist (Course Credit/Community Service Credit Eligible) Job at Childhood Cancer Society

 ...must reflect hope, positivity, and resilience in every project. Professional Experience: Minimum of 2 years of experience in sound design, audio engineering, or voice synthesis. Candidates should be prepared to share examples of projects where they used AI or audio... 

American Traveler

Travel MRI Technologist Job at American Traveler

 ...during orientation or test-out option ~ Must work a minimum of 2 major holidays during the contract ~ Timekeeping is done through Kronos ~ Returning travelers must have a 13-week break between assignments at the facility ~ All RTO must be declared prior to... 

myDermRecruiter

Medical Receptionist Job at myDermRecruiter

APDerm Job Opportunity APDerm is a physician-led, patient-centered dermatology network. Founded in 1992, we have 37 practices across Massachusetts, New Hampshire and Rhode Island and pride ourselves on being the "partner of choice" for patients and employees alike. ...

OPENLANE

Customer Support Representative (M-F, 11A-8P EST) Job at OPENLANE

 ...Who We Are: At OPENLANE we make wholesale easy so our customers can be more successful. Were a technology company building the...  ...Robust Employee Assistance Program Employer paid Leap into Service Day to volunteer Tuition Reimbursement for eligible programs... 

LHH

Training Implementation Specialist Job at LHH

 ...Onboarding Implementation Specialist | Automotive Digital Solutions Were looking for an Onboarding Implementation Specialist to support...  ..., and issue resolution Deliver virtual and on-site training to drive user adoption Assist with onboarding activities, orientation...