2,614 Open roles
98 Companies
54 Posted today
Jobs / Tencent Games / Research Internship – Reinforcement Learning for Large Foundation Models
This job is no longer available.

This position has been closed.

Posted 2026-05-22

Research Internship – Reinforcement Learning for Large Foundation Models

Description

Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The lab's long-term ambition is to drive the development of Artificial General Intelligence (AGI), and ultimately, Artificial Superintelligence (ASI). We are currently seeking research interns for the year of 2026, in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning ang agent tasks and enhance their capabilities in autonomous exploration and continuous learning. Our Seattle area office is located in Bellevue WA.

Every research intern will work with researchers on a research project aimed at attacking one of the core problems on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models. We will conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.

Responsibilities
  • Conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents.
  • Deliver impactful algorithms for real world applications.
  • Publish influential research papers.
Requirements
  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university.
  • Self-motivated and excited about developing novel techniques.
  • Research experiences in natural language processing or machine learning.
  • Proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch.
  • Good publication track records and history of creativity and intellectual flexibility.
  • Excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation.
Benefits
  • 3 months duration (with the possibility of extension).
  • Eligible for 1 hour of paid sick leave for every 30 hours worked.
  • Up to 13 paid holidays throughout the calendar year.
  • Eligible to enroll in the Company-sponsored medical plan.
Similar Active Jobs
Light & WonderProduct & DevelopmentMarousi, Greece

Senior Software Engineer (Java)

We are looking for an experienced Senior Software Engineer to join a high-performing agile team. You will participate in all stages of the software product development life cycle, including analyzing systems, writing Java code, and troubleshooting bugs. Ideal candidates have at least 5 years of experience in web system design and development and can lead technical discussions. The role offers competitive benefits, a supportive environment, and opportunities for career growth.

HybridFull-timeSenior5+ yearsEnglish
2026-06-18
Light & WonderProduct & DevelopmentMacau, China

Global Sr. Commercial Product Manager - Table Games

This role is for a Global Sr. Commercial Product Manager focusing on Table Games. You will lead market assessments, evaluate product performance, and translate player insights into product requirements. You will also partner with sales teams, manage installations, and serve as a subject matter expert for Table Games. Collaboration with product development, engineering, and compliance teams is crucial to ensure products meet technical and regulatory requirements across global markets. The role involves building relationships with operators, distributors, and technical partners, as well as supporting commercial proposals and RFP responses.

On-siteFull-timeSenior8-12 yearsEnglish
2026-06-18
AristocratProduct & DevelopmentNoida, India

Sr Engineer II-2

This role requires 4-7 years of experience in manual closed system testing, with a focus on digital games. You will be responsible for writing test plans, analyzing test approaches, and ensuring quality standards are met. Collaboration with product managers, designers, and engineers is key to delivering features and improvements. The position is based in Noida, India, and is a full-time, on-site role.

On-siteFull-timeSenior4-7 yearsEnglish
2026-06-18
BetwayProduct & DevelopmentCape Town, South Africa

Software Engineer (Front-End)

We are seeking passionate and driven individuals to join Super Group International on a thrilling journey of growth and innovation. As a Software Engineer (Front-End), you will build and iterate on the WTF Games frontend, create fast and reactive interfaces, and integrate with Elantil and Directus CMS. You will ship landing pages, lobby systems, and game UIs at high velocity, implement tracking, and continuously optimise UX based on real user behaviour. This role requires strong experience with React/Next.js and headless CMS platforms, along with a collaborative mindset and exceptional attention to detail.

On-siteFull-timeMid-levelEnglish
2026-06-18
BoostaProduct & DevelopmentRemote

Senior AI/ML Engineer

We’re looking for a Senior AI/ML Engineer with 3+ years of enterprise experience building real AI enterprise solutions. You will design and ship ML and agentic AI systems end‑to‑end, from quick prototypes to scalable, production‑grade solutions. You’ll work closely with product, design, and business stakeholders to lead the AI/software engineering team and translate complex AI architectures into clear, understandable user experiences.

RemoteFull-timeSenior3+ yearsEnglish
2026-06-18