AI Inference Engineer Job at Signify Technology, Santa Clara, CA

ZGwvTmNCS2FtZ29FV3dQd0d1Z3RoUHQ4TlE9PQ==
  • Signify Technology
  • Santa Clara, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Harrisburg Consumer Services

Customer Service Representative Job at Harrisburg Consumer Services

 ...Our company is renowned for top-notch customer service, and it's what separates us from the rest...  ...industry. Our Customer Service Representative team focuses on doing whats best for...  ...needs Attending training meetings and virtual conference calls to retain fresh knowledge... 

Macys

Asset Protection Detective, Queens Center - Full Time Job at Macys

 ...Be part of an amazing story Macys is more than just a store. Were a story. One thats captured the hearts and minds of America for more than 160 years. A story about innovations and traditionsabout inspiring stores and irresistible productsabout the excitement of the... 

New Balance Athletics, Inc.

Manager, GTM Strategy - Direct to Consumer (DTC) (Boston) Job at New Balance Athletics, Inc.

 ...Who We Are: Since 1906, New Balance has empowered people through sport and craftsmanship to create positive change in communities around the world. We innovate fearlessly, guided by our core values and driven by the belief that conventions were meant to be challenged... 

Altercare Integrated Health Services

Housekeeping and Laundry Worker - Part Time Job at Altercare Integrated Health Services

 ...Housekeeper and Laundry Worker - Part TimeAltercare CoshoctonCoshocton, OhioAre you looking for an exciting career serving the elderly in the healthcare field.Altercare Coshocton is seeking a Housekeeper and Laundry Worker in our Environmental Cleaning Department. Weekend... 

Ascendion

Firmware Engineer Job at Ascendion

 ...About Ascendion : Ascendion is a full-service digital engineering solutions company. We make and manage software platforms and products...  ...their best at Ascendion. About the Role: Job Title: Firmware Development Engineer Job Description: The team is...