sideara-image

Xomnia is a word combining the letter ‘X’ – the unknown – and “Omnia” – Latin for everything. Our team of data scientists and big data engineers are trained to find the undefined – X – in all the relevant data sources – Omnia. This unknown – X – is untapped business value. Combining the X and Omnia you get the Xomnia spirit. Eager, curious and dedicated people, who have the belief that the future is big data.

Want to feel the Xomnia spirit? Follow us

What are you looking for?

Simply enter your keyword and we will help you find what you need.

Training

XomniaTrainingsText mining next level
  • 09:30 - 13:00
  • Xomnia HQ
  • €800

For questions or additional information view contact information below

Text mining next level

So, you’ve got some text mining experience under your belt? We’re here to take you to the next level! Now that you’ve applied text classification techniques to solve well-known challenges such as sentiment analysis and topic classification, it’s time to dive into unsupervised models. Using these advanced techniques, we may discover new topics in a dataset, or identify different types of customers by analyzing email interactions.

This course will teach you how to apply unsupervised learning techniques to extract information from unlabeled text data. You will learn how to perform document clustering and topic modeling, and visualize and interpret your results. You will also learn about word vectors and apply some state-of-the-art text mining algorithms based on neural networks. At the end of this training, you should have a full arsenal of techniques to approach any text mining problem.

Tools

  • Python
  • scikit-learn
  • umap
  • bokeh

Techniques

  • Document similarity and clustering
  • Topic modeling
  • Visualizing high-dimensional data for data exploration
  • Word embeddings
  • Word2Vec

Requirements: The course assumes some experience with text preprocessing and with text classification in Python’s scikit-learn. You should also possess basic knowledge on generic unsupervised learning techniques.

  • Data science
  • Python
  • Scikit-learn
  • Text mining