Next word prediction presentation

Project presentation for the word predictor app developed for the Data Science Coursera Specialization Capstone project

Motivation

Building an algorithm for predicting the most probable word based on the typed in phrase.



Credit goes to SwiftKey for providing the data, which I could build my predictive model on.

The model

In detail

The input phrase is cleaned and calculated how many words (n) it includes.

Thereafter, the n+1-gram table is checked if there is any match between the input and the n+1-gram table without tail. If there is a match, the most frequently appearing tail is chosen as output and stored in a table along with n+1-gram table information.

Then, the first word from the input is excluded and the process continues with the reduced input until a single word becomes the input.
Thereafter, the output is chosen, which is referred to the highest n-gram table. If no prediction is found, the output is 'Sorry, nothing found'.

Application

You can find the application here, on shinyapps.

You need to enter the wished phrase and the algorithm will try to predict

And on the other side, the predicted word appears along with the other possible predicted words in a table.