Tapping Into the Hidden Potential of Text Data

  • Stop words removal — removing words that frequently appear in the text, but carry no information that may allow for distinguishing between documents. Examples of stop words are the, of, a.
  • Stemming — reducing inflected forms of a word into its root or stem. For example, the stem of fishing is fish
  • Part-of-speech tagging — marking each word by its part of speech, e.g. noun, verb
  • Dependency parsing — generating a syntax tree for a sentence
  • Vectorisation — mapping objects (here words) into vectors of numbers in some vector space

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Mind Foundry

Mind Foundry

Artificial Intelligence for high-stakes applications.