Researcher – Reinforcement Learning and LLM Reasoning

Huawei

Job title:

Researcher – Reinforcement Learning and LLM Reasoning

Company

Huawei

Job description

Our team has a 12-month contract opening for a Researcher.Responsibilities:

  • Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.
  • Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.
  • Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
  • Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.

What you’ll bring to the team:

  • PhD or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.
  • Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).
  • Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.
  • Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.
  • Proficient programming skills in Python and strong experience with model development and optimization.
  • Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.
  • Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.

Expected salary

Location

Montreal, QC

Job date

Fri, 20 Sep 2024 01:03:53 GMT

To help us track our recruitment effort, please indicate in your email/cover letter where (jobsnearcanada.com) you saw this job posting.

To apply for this job please visit jobviewtrack.com.

Job Location