Choose date to book a ticket

July 2024
Select a month to display
Month

A pracitcal introduction to topic modelling

Christian Göbel

How to quickly make sense of a body of text that is too large for a single human to read? This workshop provides a practical introduction to topic modeling, a form of text mining that uncovers hidden semantic structures ("topics") in a corpus of documents. Topic modelling is a form of unsupervised machine learning suitable for eliciting how prominent certain topics are in a corpus, how they are connected, and how they develop over time. With a bit of caution, researchers can also use topic modelling algorithms to classify documents and thereby make them amenable to statistical analysis.

The workshop consists of four parts. First, participants will receive a brief introduction into the use cases of topic modelling, the most commonly used algorithms, and their strengths and weaknesses. Second, participants will learn how to preprocess text for analysis (remove stop words, lemmatise words, segment Chinese language documents), select the hyperparameters of Latent Dirichlet Allocation (LDA) models and decide on an appropriate number of topics. In the third part, we will use the fitted model to classify documents, inspect a random sample of classified documents and discuss the accuracy of classification.
Finally, participants will learn how to visualise the prevalence, development and connection of topics as a bar chart, line diagram and correlation plot, respectively.

The booking period for this event is over.

Staatsbibliothek zu Berlin
Potsdamer Straße 33
Simon-Bolivar-Saal
10785 Berlin

Wed, July 10, 2024
Begin: 09:00
End: 12:30
Add to Calendar

Participant

19 currently available

free

Mon	Tue	Wed	Thu	Fri	Sat	Sun
			1	2	3	4
5	6	7	8	9	10	11
12	13	14	15	16	17	18
19	20	21	22	23	24	25
26	27	28	29	30	31

Charting the European D-SEA: Digital Scholarship in East Asian Studies

Choose date to book a ticket

A pracitcal introduction to topic modelling

Products

Uncategorized items

Participant