Large Model AI Infrastructure Intern
Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers. TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia, TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.
- Address issues related to computational capacity limitations, communication delays, and the scalability of distributed systems, in accordance with business needs.
- Collaborate with algorithm, hardware, and operations teams to develop efficient and stable computing infrastructure.
- Solid foundation in computer architecture; familiarity with parallel computing and the design of data-intensive systems.
- Strong mathematical skills, including linear algebra, numerical analysis, and optimization of algorithm complexity.
- Experience with inference frameworks such as TensorRT or Triton Inference Server is a plus.
- In-depth understanding of TCP/IP and RDMA protocol stacks; familiarity with DPDK/SPDK development is also beneficial.
- Those with experience in optimizing networks within supercomputing centers or cloud computing environments are preferred.
- Knowledge of the Transformer architecture and the main procedures involved in training large models.



