View Proposal


Proposer
Phil Bartie
Title
Text to Data Tools
Goal
Build a tool which can capture text from web/docs, extract information using NLP, save data to a database, display on a web map for those with locations
Description
Website can contain valuable data in the form of text (e.g. web text, PDFs, docx files). NLP (inc LLMs) allows extraction of the data within the text, parsing it to find specified items (eg dates, locations, names of people, tools, costs, and other values). This project will be specifically focus on producing an application that performs this job for specified target tasks (eg locations where natural capital tools are used). The output would be a set of web URLs, documents (eg PDFs), and the corresponding values stored in a database (e.g. locations, costs, tool names, organisation using the tools). A UI to search the data could also be developed, including highlighting spatial locations where tools are used and linking to the relevant documents.
Resources
LLM
Background
Url
Difficulty Level
Moderate
Ethical Approval
Full
Number Of Students
1
Supervisor
Phil Bartie
Keywords
Degrees
Bachelor of Science in Computer Science
Bachelor of Science in Computer Systems
Master of Engineering in Software Engineering
Master of Science in Data Science
Master of Science in Software Engineering