Competencies: AI & ML > Natural Language Processing
Natural Language Processing
Body of Knowledge
| Topic | Description | Relevance | Career Tracks |
|---|---|---|---|
Text Preprocessing |
Tokenization, normalization, stemming/lemmatization, stopwords, regex for text. |
Critical |
NLP Engineer, Data Scientist |
Text Classification |
Sentiment analysis, topic classification, multi-label, imbalanced classes. |
Critical |
NLP Engineer, Data Scientist |
Named Entity Recognition |
NER models, spaCy, custom entities, BIO tagging, domain-specific NER. |
High |
NLP Engineer, Data Scientist |
Embeddings |
Word2Vec, GloVe, sentence embeddings, BERT embeddings, embedding visualization. |
Critical |
NLP Engineer, ML Engineer |
Transformers for NLP |
BERT, RoBERTa, DistilBERT, fine-tuning for downstream tasks, Hugging Face. |
Critical |
NLP Engineer, ML Engineer |
Text Generation |
Language models, beam search, sampling (temperature, top-k, top-p), repetition. |
High |
NLP Engineer, AI Engineer |
Question Answering |
Extractive QA, reading comprehension, SQuAD, open-domain QA. |
High |
NLP Engineer, AI Engineer |
Summarization |
Extractive vs abstractive, BART, T5, evaluation (ROUGE), length control. |
Medium |
NLP Engineer, AI Engineer |
Machine Translation |
Seq2seq, attention, multilingual models, mBART, translation quality. |
Medium |
NLP Engineer, ML Engineer |
spaCy/Hugging Face |
Pipeline components, custom components, Hugging Face Hub, Transformers library. |
Critical |
NLP Engineer, ML Engineer |
Personal Status
| Topic | Level | Evidence | Active Projects | Gaps |
|---|---|---|---|---|
No personal status recorded |
— |
— |
— |
— |