Researcher – Reinforcement Learning and LLM Reasoning

Contract
Montreal, QC
Posted 2 months ago

Huawei

Huawei

Job title:

Researcher – Reinforcement Learning and LLM Reasoning

Company

Huawei

Job description

Our team has a 12-month contract opening for a Researcher.Responsibilities:

Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.

Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.

Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.

What you’ll bring to the team:

PhD or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.

Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).

Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.

Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.

Proficient programming skills in Python and strong experience with model development and optimization.

Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.

Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.

Expected salary

Location

Montreal, QC

Job date

Fri, 20 Sep 2024 01:03:53 GMT

To help us track our recruitment effort, please indicate in your email/cover letter where (jobsnearcanada.com) you saw this job posting.

Share: