View Proposal
-
Proposer
-
Phil Bartie
-
Title
-
Text to Data Tools
-
Goal
-
Build a tool which can capture text from web/docs, extract information using NLP, save data to a database, display on a web map for those with locations
-
Description
- Website can contain valuable data in the form of text (e.g. web text, PDFs, docx files). NLP (inc LLMs) allows extraction of the data within the text, parsing it to find specified items (eg dates, locations, names of people, tools, costs, and other values).
This project will be specifically focus on producing an application that performs this job for specified target tasks (eg locations where natural capital tools are used). The output would be a set of web URLs, documents (eg PDFs), and the corresponding values stored in a database (e.g. locations, costs, tool names, organisation using the tools). A UI to search the data could also be developed, including highlighting spatial locations where tools are used and linking to the relevant documents.
- Resources
-
LLM
-
Background
-
-
Url
-
-
Difficulty Level
-
Moderate
-
Ethical Approval
-
Full
-
Number Of Students
-
1
-
Supervisor
-
Phil Bartie
-
Keywords
-
-
Degrees
-
Bachelor of Science in Computer Science
Bachelor of Science in Computer Systems
Master of Engineering in Software Engineering
Master of Science in Data Science
Master of Science in Software Engineering