Overview
This proposed PhD research aims to conduct an in-depth investigation into the security and privacy risks associated with large language models (LLMs). As the deployment of LLMs becomes increasingly prevalent across various domains, including security-sensitive domains, understanding and mitigating potential threats is paramount. The focus will be on studying jailbreaks, evasion from safeguards, and data privacy attacks, with the ultimate goal of proposing robust defensive strategies towards safe LLMs.
Abstract:
This proposed PhD research aims to conduct an in-depth investigation into the security and privacy risks associated with large language models (LLMs). As the deployment of LLMs becomes increasingly prevalent across various domains, understanding and mitigating potential threats is paramount. The focus will be on studying jailbreaks, evasion from safeguards, and data privacy attacks, with the ultimate goal of proposing robust defensive strategies.
Research Objectives:
1. Jailbreak Analysis:
– Examine the vulnerabilities that may lead to the compromise of LLMs, allowing unauthorized access or manipulation of their underlying systems.
– Investigate the implications of jailbreaks in terms of model integrity, user trust, and the broader security landscape.
2. Safeguard Evasion:
– Explore methods by which LLMs can potentially bypass or evade existing security safeguards, such as adversarial attacks or exploitation of weaknesses in protective mechanisms.
– Assess the effectiveness of current safeguards and identify areas for improvement to enhance resilience against evasion attempts.
3. Data Privacy Attacks:
– Investigate the risks posed by LLMs in compromising user data privacy, both in terms of input data and generated output.
– Analyse potential scenarios where LLMs might inadvertently leak sensitive information and assess the magnitude of privacy threats.
4. Defensive Strategies:
– Devise and propose advanced defensive mechanisms to safeguard LLMs against jailbreaks, safeguard evasion, and data privacy attacks.
– Evaluate the efficacy of the proposed defenses through simulations, empirical studies, and real-world scenarios.
Significance:
This research will contribute to the growing body of knowledge on LLM security and privacy, providing insights that can inform the development of safer and more secure language models. The proposed defensive strategies aim to mitigate the identified risks, fostering the responsible deployment of LLMs in diverse applications.
Funding Information
To be eligible for consideration for a Home DfE or EPSRC Studentship (covering tuition fees and maintenance stipend of approx. £19,237 per annum), a candidate must satisfy all the eligibility criteria based on nationality, residency and academic qualifications.
To be classed as a Home student, candidates must meet the following criteria and the associated residency requirements:
• Be a UK National,
or • Have settled status,
or • Have pre-settled status,
or • Have indefinite leave to remain or enter the UK.
Candidates from ROI may also qualify for Home student funding.
Previous PhD study MAY make you ineligible to be considered for funding.
Please note that other terms and conditions also apply.
Please note that any available PhD studentships will be allocated on a competitive basis across a number of projects currently being advertised by the School.
A small number of international awards will be available for allocation across the School. An international award is not guaranteed to be available for this project, and competition across the School for these awards will be highly competitive.
Academic Requirements:
The minimum academic requirement for admission is normally an Upper Second Class Honours degree from a UK or ROI Higher Education provider in a relevant discipline, or an equivalent qualification acceptable to the University.
Entrance requirements
Graduate
The minimum academic requirement for admission to a research degree programme is normally an Upper Second Class Honours degree from a UK or ROI HE provider, or an equivalent qualification acceptable to the University. Further information can be obtained by contacting the School.
International Students
For information on international qualification equivalents, please check the specific information for your country.
English Language Requirements
Evidence of an IELTS* score of 6.0, with not less than 5.5 in any component or equivalent qualification acceptable to the University is required (*taken within the last 2 years).
International students wishing to apply to Queen’s University Belfast (and for whom English is not their first language), must be able to demonstrate their proficiency in English in order to benefit fully from their course of study or research. Non-EEA nationals must also satisfy UK Visas and Immigration (UKVI) immigration requirements for English language for visa purposes.
For more information on English Language requirements for EEA and non-EEA nationals see: www.qub.ac.uk/EnglishLanguageReqs.
If you need to improve your English language skills before you enter this degree programme, INTO Queen’s University Belfast offers a range of English language courses. These intensive and flexible courses are designed to improve your English ability for admission to this degree.
How to Apply
Apply using our online Postgraduate Applications Portal and follow the step-by-step instructions on how to apply.
Find a supervisor
If you’re interested in a particular project, we suggest you contact the relevant academic before you apply, to introduce yourself and ask questions.
To find a potential supervisor aligned with your area of interest, or if you are unsure of who to contact, look through the staff profiles linked here.
You might be asked to provide a short outline of your proposal to help us identify potential supervisors.