Topic Classification – Bridging Topic Modelling and Text Classification
People talking about topics - Credits Udemy blog

Topic Classification – Bridging Topic Modelling and Text Classification

Processing human language is a wide field with many aspects that can be of interest. One of such aspects is to find out the actual topic that a piece of written text is writing about. For example, if we have a text written about a match between FC Barcelona and Manchester United, we may be interested in finding out that the actual topic is football, even though the particular word football is not used in the text.

This is in fact a form of text classification and should not be mistaken with topic modeling. In the latter, we are conceptually looking for latent words that are in the text that best describe what the text is about. For topic modeling, we usually need more evidence in the text and we cannot tell beforehand what the outcome will be, because it is subject to the actual text’s content. In practical settings, aggregating over topic modeling to find common topics is something that is hence not straightforward.

...

Read the full article on our blog

要查看或添加评论,请登录

Erik Tromp的更多文章

  • How Gen Z is reshaping today's job market

    How Gen Z is reshaping today's job market

    Generation Z is making its mark on the job market, transforming the employer-employee relationship as we know it. This…

  • Staffing 2.0 - Programmatic Matching

    Staffing 2.0 - Programmatic Matching

    Wow! I have been so occupied with #Personality #Match that I hardly had time to blog about it and explain to the world…

    1 条评论
  • PersonalityMatch on ProductHunt!

    PersonalityMatch on ProductHunt!

    We call it matching 2.0! Blending in personality driven by AI to make the old-fashioned recruitment and staffing…

  • Fake News Detection

    Fake News Detection

    With the recent developments of fake news playing a role in Trump’s elections, Cambridge Analytica using it to great…

  • ‘Een pakket dat geautomatiseerd bedrijven helpt’

    ‘Een pakket dat geautomatiseerd bedrijven helpt’

    Het kennen van je klant zou het uitgangspunt moeten zijn voor elk bedrijf. Met de opkomst van big data wordt het steeds…

    1 条评论
  • Deriving Personality Traits from Text

    Deriving Personality Traits from Text

    If you’d ask me, one the most compelling fields in language processing is that of authorship profiling. In this field…

  • The (Non-)Sense of Word Vectors (2/2)

    The (Non-)Sense of Word Vectors (2/2)

    This is the second part in a two-series blog. Read the first part here.

  • The (Non-)Sense of Word Vectors

    The (Non-)Sense of Word Vectors

    In this new blog post we explore the power of word vectors as many claim. We show the boundaries of what they are…

  • The Need to Know Your Customer

    The Need to Know Your Customer

    The field of customer experience monitoring is a booming business, just google the term and you will be overloaded with…

    1 条评论
  • Unsupervised Deep Parsing

    Unsupervised Deep Parsing

    As I have written before, it is not always easy to understand the possibilities of current-day text analytics…

社区洞察

其他会员也浏览了