Research Intern - Systems For Efficient AI
Microsoft
United States of America
Summary
Explore research internships in a global tech company's systems for AI inference. Contribute to end-to-end AI pipelines, scheduling, batching, KV caching, and GPU fleet orchestration. Collaborate across teams to advance efficient AI, reduce latency, and optimize compute for large-scale models.