Researcher – Reinforcement Learning and LLM Reasoning

Job title:

Researcher – Reinforcement Learning and LLM Reasoning

Company

Huawei

Job description

Our team has a 12-month contract opening for a Researcher.Responsibilities:

  • Conduct cutting-edge research in the field of Natural Language Processing /Large Language Models (LLMs), focusing on advancing model reasoning.
  • Leverage Reinforcement learning and Deep Learning to enhance reasoning capabilities and decision-making of LLMs.
  • Design, implement, and experiment with novel methods to improve model and performance and efficiency in real-world applications.
  • Publish high-quality research in top-tier AI/ML conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the broader machine learning community.

What you’ll bring to the team:

  • PhD or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Mathematics, or a related technical field.
  • Strong research background with a track record of publications in top-tier AI conferences (NeurIPS, ICML, ICLR, ACL, etc.).
  • Expertise in Large Language Models (LLMs) and hands-on experience with frameworks like PyTorch or TensorFlow.
  • Hands-on experience with fine-tuning, RLHF, and applying advanced reasoning methods such as Chain of Thought and In-Context Learning.
  • Proficient programming skills in Python and strong experience with model development and optimization.
  • Effective analytical, problem-solving, and troubleshooting skills with a focus on innovative research solutions.
  • Strong communication skills, both written and verbal, and a demonstrated ability to convey complex research findings to a variety of audiences.

Expected salary

Location

Montreal, QC

Job date

Fri, 20 Sep 2024 01:03:53 GMT

To help us track our recruitment effort, please indicate in your email/cover letter where (jobsnearcanada.com) you saw this job posting.

Share

Senior React Native/Flutter Developer (Remote)

Job title: Senior React Native/Flutter Developer (Remote) Company Parent Organization Job description We are hiring…

5 minutes ago

Factory Sales and Order Management Head

Job title: Factory Sales and Order Management Head Company Robert Half Job description We are…

17 minutes ago

APTPUO – Spring-Summer 2025 – School of Nutrition Sciences – NUT1150 A

Job title: APTPUO - Spring-Summer 2025 - School of Nutrition Sciences - NUT1150 A Company…

19 minutes ago

Analyste-programmeur senior in Quebec, Quebec

jobsnearcanada.com null Requisition ID : 28558 Apply Now To help us track our recruitment effort,…

46 minutes ago

Specialist, Employer Relations (Center for Career & Experiential Education Liaison)

jobsnearcanada.com Posting Number SF01923 Job Title Specialist, Employer Relations (Center for Career & Experiential Education Liaison) Position…

46 minutes ago

Cost Analyst in Montreal, Quebec

jobsnearcanada.com We Create Products and Brands That People Trust to Clean, Sanitize, and Protect Their…

46 minutes ago
For Apply Button. Please use Non-Amp Version

This website uses cookies.