Sr. Cloud AI Infrastructure Engineer

Description

Conduct in-depth research into the underlying hardware logic of various AI accelerators; evaluate the power-efficiency ratio and suitability of different heterogeneous architectures in the context of Large Language Model (LLM) inference and training. Design and optimize high-performance operator libraries for large-scale cloud computing environments; resolve long-tail latency issues in hardware scheduling, memory management, and distributed communication. Define the interconnect architecture; drive the virtualization, standardized access, and efficient pooling of heterogeneous computing resources in the cloud. Monitor global trends in semiconductors and accelerators; perform feasibility studies and experimental validation for the implementation of emerging technologies within cloud infrastructure.

Responsibilities

Architecture Research: Conduct in-depth research into the underlying hardware logic of various AI accelerators; evaluate the power-efficiency ratio and suitability of different heterogeneous architectures in the context of Large Language Model (LLM) inference and training.
Operator & Performance Optimization: Design and optimize high-performance operator libraries for large-scale cloud computing environments; resolve long-tail latency issues in hardware scheduling, memory management, and distributed communication.
Interconnect Architecture Definition: Define the interconnect architecture ; drive the virtualization, standardized access, and efficient pooling of heterogeneous computing resources in the cloud.
Technology Trend Analysis: Monitor global trends in semiconductors and accelerators; perform feasibility studies and experimental validation for the implementation of emerging technologies within cloud infrastructure.

Requirements

Master’s or Ph.D. degree in Computer Engineering, Electronic Engineering, Microelectronics, or a related field.
Expertise in GPGPU architectures or other mainstream AI accelerator architectures.
Proficient in parallel computing frameworks; deep understanding of low-level operator development languages (e.g., CUDA, Triton).
Solid understanding of large-scale distributed systems, cluster topologies (e.g., Fat-tree, Torus), and high-performance network protocols.
Familiar with the architectural evolution of global leading computing enterprises; ability to objectively analyze the technical pros/cons and engineering challenges of different architectural paths.
Experience in the application, optimization, or architectural design of ultra-large-scale accelerator clusters is preferred.
Experience in the low-level adaptation and performance tuning of mainstream deep learning frameworks (e.g., PyTorch, TensorFlow) is preferred.

Benefits

Sign on payment (case-by-case basis)
Relocation package (case-by-case basis)
Restricted stock units (case-by-case basis)
Medical, dental, vision, life and disability benefits
Participation in the Company’s 401(k) plan
15 to 25 days of vacation per year (depending on tenure)
13 days of holidays throughout the calendar year
10 days of paid sick leave per year

Similar Active Jobs

Light & WonderProduct & DevelopmentMarousi, Greece

Senior Software Engineer (Java)

We are looking for an experienced Senior Software Engineer to join a high-performing agile team. You will participate in all stages of the software product development life cycle, including analyzing systems, writing Java code, and troubleshooting bugs. Ideal candidates have at least 5 years of experience in web system design and development and can lead technical discussions. The role offers competitive benefits, a supportive environment, and opportunities for career growth.

HybridFull-timeSenior5+ yearsEnglish

2026-06-18

Light & WonderProduct & DevelopmentMacau, China

Global Sr. Commercial Product Manager - Table Games

This role is for a Global Sr. Commercial Product Manager focusing on Table Games. You will lead market assessments, evaluate product performance, and translate player insights into product requirements. You will also partner with sales teams, manage installations, and serve as a subject matter expert for Table Games. Collaboration with product development, engineering, and compliance teams is crucial to ensure products meet technical and regulatory requirements across global markets. The role involves building relationships with operators, distributors, and technical partners, as well as supporting commercial proposals and RFP responses.

On-siteFull-timeSenior8-12 yearsEnglish

2026-06-18

AristocratProduct & DevelopmentNoida, India

Sr Engineer II-2

This role requires 4-7 years of experience in manual closed system testing, with a focus on digital games. You will be responsible for writing test plans, analyzing test approaches, and ensuring quality standards are met. Collaboration with product managers, designers, and engineers is key to delivering features and improvements. The position is based in Noida, India, and is a full-time, on-site role.

On-siteFull-timeSenior4-7 yearsEnglish

2026-06-18

BetwayProduct & DevelopmentCape Town, South Africa

Software Engineer (Front-End)

We are seeking passionate and driven individuals to join Super Group International on a thrilling journey of growth and innovation. As a Software Engineer (Front-End), you will build and iterate on the WTF Games frontend, create fast and reactive interfaces, and integrate with Elantil and Directus CMS. You will ship landing pages, lobby systems, and game UIs at high velocity, implement tracking, and continuously optimise UX based on real user behaviour. This role requires strong experience with React/Next.js and headless CMS platforms, along with a collaborative mindset and exceptional attention to detail.

On-siteFull-timeMid-levelEnglish

2026-06-18

BoostaProduct & DevelopmentRemote

Senior AI/ML Engineer

We’re looking for a Senior AI/ML Engineer with 3+ years of enterprise experience building real AI enterprise solutions. You will design and ship ML and agentic AI systems end‑to‑end, from quick prototypes to scalable, production‑grade solutions. You’ll work closely with product, design, and business stakeholders to lead the AI/software engineering team and translate complex AI architectures into clear, understandable user experiences.

RemoteFull-timeSenior3+ yearsEnglish

2026-06-18

Sr. Cloud AI Infrastructure Engineer

Senior Software Engineer (Java)

Global Sr. Commercial Product Manager - Table Games

Sr Engineer II-2

Software Engineer (Front-End)

Senior AI/ML Engineer

Sign in

Job Alerts