Share
Identifying semantically identical questions on, Question and Answering(Q&A) social media platforms like Quora is exceptionally significant to ensure that the quality and the quantity of content are presented to users, based on the intent of the question and thus enriching the overall user experience. Detecting duplicate questions is a challenging problem because natural language is very expressive, and a unique intent can be conveyed using different words, phrases, and sentence structuring. Machine learning and deep learning methods are known to have accomplished superior results over traditional natural language processing techniques in identifying similar texts.In our study, we explored and applied different deep learning techniques on the task of identifying duplicate questions on Quora’s question pair dataset. We applied deep learning techniques to model three different deep neural networks of multiple layers consisting of Glove embeddings, Bidirectional long-short term memory, Global Max pooling, Dense, Batch Normalization, Activation functions, and model merge.
Applied Data Science assumes a significant role in the backdrop of massive data generated from millions of devices. Currently, the daily data output is over 2.5 quintillion bytes, likely to touch 1.7 Mb of data per second per person on the planet in the future. The 12-month PG Level Advanced Certification Programme in Applied Data Science and Machine Learning enables learners to build deep tech capabilities and apply their learnings to make data-driven business decisions for their organizations. TalentSprint is offering this programme in partnership with the Robert Bosch Centre for Data Science and AI (RBCDSAI), India’s leading research hub at IIT Madras. Participants will experience a unique learning process that includes masterclass lectures, hands-on labs, hackathons, workshops, industry interactions, and a campus visit for fast-track learning.