Text mining: Using LDA to create a topic model to cluster news articles step by step
Step 1: Install and import relevant libraries Step 2: Load the data and convert data to DataFrame Step 3: Clean Text Step 4: Make a word cloud to highlight popular words in the text Step 5: Vectorize text data using TF-IDF Step 6: Build LDA model Step 7: Generate Topic-Word Distributions and Document-Topic Distribution Step 8: Visualize the topics (1)Bar chart to visualize the topic distribution in documents; (2) Time series of the popularity (trend) of the topics throughout the months of 2016 and 2017; 3) Elbow Curves to justify the number of topics