Make AI run on
every device

We're a research group building the future of on-device AI. Private, fast, and free from cloud dependency.

Open Roles

AI Talent Product

Define and shape edge AI products that bring powerful, private AI directly to users' devices.

Beijing, China / Hefei Anhui, China

We Hope You

  • Have an engineering background
  • Deeply understand the edge AI landscape — cloud vs on-device trade-offs, privacy-first design, and user-centric AI
  • Love reading — stay current with AI research and industry trends
  • Have read at least 30 AI papers (100+ if born after 1998)
  • Can translate cutting-edge research into practical product roadmaps
  • Passionate about making AI accessible, private, and free for everyone
Apply Now

Edge AI Research Engineer

Research and develop model compression techniques to make state-of-the-art AI models run efficiently on consumer devices.

Beijing, China / Hefei Anhui, China

Tech Stack

PythonPyTorchMLXONNX

Focus

  • Model compression: quantization (FP16→INT8→INT4), structured/unstructured pruning, and knowledge distillation.
  • Designing efficient model architectures optimized for edge deployment (1B–4B parameters).
  • Benchmarking inference performance across consumer hardware (Apple Silicon, Snapdragon, etc.).
  • Reproducing and improving state-of-the-art methods from deep learning literature.
  • Publishing research and contributing to open-source projects.

Ideal Experience

  • Deep understanding of Transformer architecture and large language models.
  • Hands-on experience with model compression techniques (quantization, pruning, distillation).
  • Familiarity with efficient architectures: MobileLLM, Phi, Gemma, and similar.
  • Performance profiling and optimization on resource-constrained devices.
  • Track record of open-source contributions or published research.
Apply Now

Apple MLX Engineer

Optimize the MLX framework and build high-performance ML infrastructure on Apple Silicon.

Beijing, China / Hefei Anhui, China

Tech Stack

PythonMLXMetalSwiftC++

Focus

  • Optimizing MLX framework internals for maximum performance on M-series chips.
  • Memory optimization: mmap acceleration, zero-copy model loading, unified memory utilization.
  • Integrating MLX with Apple frameworks (Core ML, Metal Performance Shaders).
  • Profiling and debugging ML workloads on Apple Silicon.
  • Building inference pipelines for on-device LLM, VLM, and TTS models.

Ideal Experience

  • Hands-on development with the MLX framework.
  • Low-level performance optimization on Apple hardware (M1–M5).
  • GPU programming with Metal or similar compute APIs.
  • Memory management and system-level optimization.
  • Contributing to open-source ML frameworks.
Apply Now

Full Stack Engineer

Build web platforms, research tooling, and developer experiences for AtomGradient's open-source ecosystem.

Beijing, China / Hefei Anhui, China

Tech Stack

ReactTypeScriptRustPythonNext.js

Focus

  • Building internal research tools, benchmarking dashboards, and model evaluation platforms.
  • Developing and maintaining AtomGradient's web presence and documentation sites.
  • Creating developer-facing tools for the open-source community.
  • End-to-end application development from backend services to frontend interfaces.

Ideal Experience

  • Building professional-grade web applications with React and TypeScript.
  • Backend service development in Rust or Python.
  • Experience with developer tools, documentation systems, or open-source project infrastructure.
  • Deploying and monitoring production web applications.
  • Interest in AI/ML and developer experience.
Apply Now

Don't see a fit?

We're always interested in hearing from people passionate about on-device AI and open-source research.

Get in touch