Štruktúra kurzu
Táto časť nie je lokalizovaná
- Introduction to natural language processing
- Chapters from computational linguistics (corpus, tokenization, morphological, syntactic and semantic analysis, entropy, perplexity)
- Text document vectorization (bag of words, one-hot encoding, TF-IDF)
- Practical taks on text classification
- Word embedding (word2vec, GloVe)
- Introduction to language modelling (n-gram models, smoothing, neural network based language models)
- Practical task on language modelling (implementation of a language detection algorithm based on language models)
- Neural network based text generator