Engineering Manager, Deep Learning Inference

Remote, USA Full-time
Job Description: • Lead, mentor, and scale a high-performing engineering team focused on deep learning inference and GPU-accelerated software • Drive the strategy, roadmap, and execution of NVIDIA’s inference frameworks engineering • Partner with internal compiler, libraries, and research teams to deliver end-to-end optimized inference pipelines • Oversee performance tuning, profiling, and optimization of large-scale models • Guide engineers in adopting best practices for CUDA, Triton, CUTLASS, and multi-GPU communications • Represent the team in roadmap and planning discussions • Foster a culture of technical excellence, open collaboration, and continuous innovation Requirements: • MS, PhD, or equivalent experience in Computer Science, Electrical/Computer Engineering, or a related field • 6+ years of software development experience • 3+ years in technical leadership or engineering management • Strong background in C/C++ software design and development • Proficiency in Python is a plus • Hands-on experience with GPU programming (CUDA, Triton, CUTLASS) • Proven record of deploying or optimizing deep learning models in production environments • Experience leading teams using Agile or collaborative software development practices Benefits: • Health insurance • Comprehensive benefits package Apply tot his job
Apply Now

Similar Jobs

Data Engineer - Healthcare

Remote, USA Full-time

Technical Project Manager – Robotics Hardware

Remote, USA Full-time

(Remote) Director of Applied Science - Healthcare AI

Remote, USA Full-time

[Remote] URGENT HIRING | Healthcare Customer Service Advocate - Remote

Remote, USA Full-time

Rust Developer (Train AI Models Part Time!)

Remote, USA Full-time

[Remote] Staff Product Manager, Managed Inference (SF/Sunnyvale/New York)

Remote, USA Full-time

Quality Assurance Engineer (AWS Lex and Google Dialogflow)

Remote, USA Full-time

Quality Assurance Engineer – PT, up to 20 hours per week

Remote, USA Full-time

VP, Quality Assurance, AI Tools

Remote, USA Full-time

[Remote] Cloud Solution Architect - Cloud & AI Infrastructure

Remote, USA Full-time

Data Scientist (Remote)

Remote, USA Full-time

Airport Ramp Agent PT (United/Delta) - CHO

Remote, USA Full-time

**Experienced Customer Service Representative – Remote Amazon Operations for Teens**

Remote, USA Full-time

**Experienced Manager State & Higher Education - Strategic Growth and Development at blithequark**

Remote, USA Full-time

[Remote] AI-Centric Solution Architecting for Global IT Intern - Entry Level Sales Program 2026

Remote, USA Full-time

FCC Portfolio Marketing Consultant (Remote US)

Remote, USA Full-time

**Experienced Customer Care Representative – Remote 4-Day Shift (Weekend) Phone and Chat Support**

Remote, USA Full-time

**Experienced Customer Operations Specialist – Debt Management and Financial Wellbeing**

Remote, USA Full-time

B2B Account Executive

Remote, USA Full-time

GSC: Governance Risk Manager

Remote, USA Full-time
Back to Home