BE/BTech & ME/MTech Final Year Projects for Computer Science | Information Technology | ECE Engineer | IEEE Projects Topics, PHD Projects Reports, Ideas and Download | Sai Info Solution | Nashik |Pune |Mumbai
director@saiinfo settings_phone02536644344 settings_phone+919270574718 +919096813348 settings_phone+917447889268
logo


SAI INFO SOLUTION


Diploma | BE |B.Tech |ME | M.Tech |PHD

Project Development and Training

Search Project by Domainwise


Research on the Detection of Text Similarity Based on Hadoop


Scalable and Secure Big Data I

Class Agnostic Image Common Ob

3D Reconstruction in Canonical
Abstract


Calculating text similarity is a key point in the detection of content duplication of science and technology project application documents, academic papers and degree papers. Aiming at the Chinese text similarity detection, a text similarity detection method based on Hadoop was proposed. The text to be detected and sample text are converted into a word segmentation matrix by using the word segmentation results, and the detection results are obtained by scanning and analyzing the matrix. MapReduce was used to realize the parallel optimization of the algorithm and improve the execution efficiency. Finally, an example is given to demonstrate the effectiveness of the proposed algorithm.

KeyWords
similarity, matrix model of word segmentation, text similarity detection, Mapreduce, Hadoop



Share
Share via WhatsApp
BE/BTech & ME/MTech Final Year Projects for Computer Science | Information Technology | ECE Engineer | IEEE Projects Topics, PHD Projects Reports, Ideas and Download | Sai Info Solution | Nashik |Pune |Mumbai
Call us : 09096813348 / 02536644344
Mail ID : developer.saiinfo@gmail.com
Skype ID : saiinfosolutionnashik