View Proposal
-
Proposer
-
Phil Bartie
-
Title
-
Natural Language Navigation based on images
-
Goal
-
-
Description
- For this project computer vision will be used to return a list of objects in view (eg from a Streetview type image) from which a user can input questions that a LLM (large language model) will be able to answer based on the knowledge gained from the image for the task of navigation.
The purpose is to create a service that when provided an image (or set of images) can answer questions related to navigation (ie direction, position).
For example:
user) which way do I head now?
system) towards the building with a red door
user) the one next to the bins?
system) yes, head towards the bins, and the red door, the cafe will be just after that on your left - it has green windows.
Topics:
Scene graph, visual grounded models, LLMs, navigation tasks, UI, UX, Geographic Information Systems, Location based services
- Resources
-
-
Background
-
-
Url
-
-
Difficulty Level
-
High
-
Ethical Approval
-
None
-
Number Of Students
-
1
-
Supervisor
-
Phil Bartie
-
Keywords
-
-
Degrees
-
Bachelor of Science in Computer Science
Bachelor of Science in Computer Systems
Master of Engineering in Software Engineering
Master of Science in Artificial Intelligence
Master of Science in Data Science