Your task is to implement an automatic language identification system that can identify… 1 answer below »

Your task is to implement an automatic language identification system that can identify 5 written languages (English, French, German, Italian and Spanish). The system will input a text and output the language identified for this text. Assume that all words in the input text are written in the same language. The system consists of two stages: Learning and Identification. Details of the system are as follows. Stage 1: Learning languages from given text files using the n-gram technique (here n = 2, bigram). The following steps are required: • Access to a given folder named Learning and verify that it contains 5 text files which are English.txt, French.txt, German.txt, Italian.txt, and Spanish.txt. These text files are in UTF-8 format. • Do the following for each of the 5 text files: o Open the current text file, read its content, change all uppercase letters to lowercase ones, and remove non-alphabetic characters such as ~ ` ! @ # $ % ^ & * ( )

Attachments:

Save your time - order a paper!

Get your paper written from scratch within the tight deadline. Our service is a reliable solution to all your troubles. Place an order on any task and we will take care of it. You won’t have to worry about the quality and deadlines

Order Paper Now
Ass1.docx
TextFilesForL….zip

 

“Looking for a Similar Assignment? Get Expert Help at an Amazing Discount!”

The post Your task is to implement an automatic language identification system that can identify… 1 answer below » appeared first on Nursing Assignment.

 

“Are you looking for this answer? We can Help click Order Now”