Hursley, Winchester SO21, UK
Posted: Mar 06, 2025
HybridInternship
Find Similar Recent Jobs
Share Job
Job Description
Introduction

The Responsible Tech team's research is focused on the intersection of technology and society. They study and devise approaches to mitigate technology risks across research and development processes, and foster innovations that expand societal benefits of technology.

Your role and responsibilities

We are seeking a motivated intern with background in computer science, artificial intelligence, applied mathematics, computational linguistics, or a related field to leverage state-of-the-art reinforcement learning techniques for language model alignment. In this role, you will have a freedom to explore multiple research directions: developing sophisticated reward models that capture human values by effectively decomposing complex value alignment into learnable components; creating synthetic training data through policy-guided rejection sampling; and implementing RL for alignment. We're particularly interested in approaches that can leverage human feedback efficiently, scale to large language models, and provide verifiable alignment guarantees. Candidates currently enrolled in graduate programs are encouraged to apply.

Required education

Bachelor's Degree

Required technical and professional expertise

  • Strong experience in deep RL
  • Familiarity with language models and alignment challenges
  • Machine Learning
  • Advanced experience with Python, PyTorch, TensorFlow
  • Experience with reward modeling and synthetic data generation is highly valued

Preferred Technical And Professional Experience

  • Cloud based computation
  • Hands on experience with fine tuning of large language models
  • Experience working with Huggingface models and data

About Business Unit

IBM Research is the organic growth engine of IBM and an innovation engine for our customers and partners. As part of this mission, IBM Research anticipates and examines 'What's Next in Computing' to ultimately create and integrate the technologies the world relies upon to solve big challenges and unlock new opportunities. We create and pioneer new markets for IBM, our partners and customers as exemplified in our ongoing quest to reach practical and large-scale quantum computing. Across IBM Research, we realize the power and potential to accelerate discovery with our partners and clients by combining the power of high performance computing, AI, and Quantum, all integrated through the hybrid cloud.

YOUR LIFE @ IBM

In a world where technology never stands still, we understand that, dedication to our clients success, innovation that matters, and trust and personal responsibility in all our relationships, lives in what we do as IBMers as we strive to be the catalyst that makes the world work better.

Being an IBMer means you’ll be able to learn and develop yourself and your career, you’ll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.

Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.

Are you ready to be an IBMer?

About Ibm

IBM’s greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.

Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we’re also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business.

At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it’s time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.

Other Relevant Job Details

For additional information about location requirements, please discuss with the recruiter following submission of your application.

Job Title

Research Scientist—Responsible Technologies Intern: 2025

Job ID

15859

City / Township / Village

Hursley

State / Province

Hampshire

Country

United Kingdom

Work arrangement

Hybrid

Area of work

Research

Employment type

Intern (Fixed Term)

Position type

Internship

Travel required

No Travel

Company

(8660) IBM United Kingdom Limited

Shift

General (daytime)

Is this role a commissionable/sales incentive based position?

No

Apply now

POST data

Note: Since your browser does not support JavaScript, you must press the button below once to proceed.
Share Job
Additional Info
Min. Experience
-
Job Location Type
Hybrid
Job Type
Internship
Predicted Seniority Level
Internship
Job Duration
-
Language(s)
All
Est. Budget/Pay Rate
To be discussed
Est. hrs
-
Optimal Job Start
-
Job Industry
Software Development
Job Function
Other
Employer
IBM Canada Software Lab - Toronto
Preferred Applicant Countries
United Kingdom
Job Ref
J25B48454AC47
Disclaimer: The jobs on Lifelancer Platform are posted by employers, recruiters and Lifelancer. We aim to provide value to Talent users by connecting them with a maximum number of curated job opportunities. However, Lifelancer doesn't represent job-offering companies and is not responsible for job advert content or updates. We aim to connect you with the employer, job source, or application link to the best of our ability. You can apply for this job on the company website.