View Proposal
-
Proposer
-
Alessandro Suglia
-
Title
-
ActionLLM: Developing Large Language Models that Learn To Use Tools
-
Goal
-
Developing LLMs that can access external tools via API calls
-
Description
- Current large language models like ChatGPT and Bard have remarkable abilities to generate very fluent natural language and they have achieved impressive performance on several benchmarks. However, for many relevant tasks in the real world, we have to make sure that these models can also interact with external sources (e.g., Wikipedia or search engines) as well as other external tools such as calculators, databases and so on. In this project, we will explore the field of large language models that can use external tools to achieve very specific goals and tasks.
- Resources
-
Please find below some important papers that you can use to read on this topic:
GorillaLM: https://arxiv.org/abs/2305.15334
ViperGPT: https://arxiv.org/abs/2303.08128
-
Background
-
The candidate student should be familiar with Python programming as well as Machine Learning/Deep Learning with a specific focus on Natural Language Processing
-
Url
-
-
Difficulty Level
-
Moderate
-
Ethical Approval
-
None
-
Number Of Students
-
1
-
Supervisor
-
Alessandro Suglia
-
Keywords
-
deep learning, neural networks, language models
-
Degrees
-
Master of Science in Artificial Intelligence
Master of Science in Data Science
Master of Science in Robotics