Senior AI Research Engineer, Model Inference (Remote)

Tether.io
Full-time , , Pakistan, , , Pakistan Engineering
Posted:
February 26, 2026
Location:
, , Pakistan, , , Pakistan, Pakistan

Job Description

Senior AI Research Engineer, Model Inference (Remote)

Join to apply for the Senior AI Research Engineer, Model Inference (Remote) role at Tether.io

Overview

We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine-tuning for Language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).

About the job

We are seeking hands-on experience with quantization techniques, LoRA architectures, Vulkan backend, and mobile GPU debugging. You will play a critical role in pushing the boundaries of desktop and on-device inference and fine-tuning performance for next-generation SLM/LLMs.

Responsibilities

  • Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware...

Apply for this Job

Submit your application for the Senior AI Research Engineer, Model Inference (Remote) position at Tether.io.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: , , Pakistan, Pakistan
Posted: February 26, 2026
Deadline: April 07, 2026