View Proposal


Proposer
Pierre Le Bras
Title
Building a Corpus Analysis Pipeline in Rust
Goal
To develop and evaluate a text corpus processing pipeline (loading, parsing, cleaning, modelling, analysing) using the Rust programming language
Description
While Python has established itself as the de facto data processing and analysis programming language for years, other languages and their features have been seemingly left out. The project aims to investigate how the programming language Rust can perform when building a text corpus analysis configurable pipeline.
Resources
Background
Url
Difficulty Level
High
Ethical Approval
None
Number Of Students
1
Supervisor
Pierre Le Bras
Keywords
text analysis, data mining, rust, data pipeline
Degrees
Bachelor of Science in Computer Science
Master of Science in Data Science
Master of Science in Software Engineering