Remote AI Model Inference Engineer — Vulkan & On-Device GPU

Tether.io
Full-time Switzerland, Switzerland Other-General
Posted:
February 21, 2026
Location:
Switzerland, Switzerland, Switzerland

Job Description

Overview

Senior AI Research Engineer, Model Inference (Remote) - Tether.io

About the job

We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine-tuning for Language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).

This role requires hands-on experience with quantization techniques, LoRA architectures, Vulkan backend, and mobile GPU debugging. You will play a critical role in pushing the boundaries of desktop and on-device inference and fine-tuning performance for next-generation SLM/LLMs.

Responsibilities

  • Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.
  • Implement and optimize full and LoRA fine-tuning for small and large languag...

Apply for this Job

Submit your application for the Remote AI Model Inference Engineer — Vulkan & On-Device GPU position at Tether.io.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: Switzerland, Switzerland
Posted: February 21, 2026
Deadline: April 02, 2026