Competencies: AI & ML > Natural Language Processing

Natural Language Processing

Body of Knowledge

Topic Description Relevance Career Tracks

Text Preprocessing

Tokenization, normalization, stemming/lemmatization, stopwords, regex for text.

Critical

NLP Engineer, Data Scientist

Text Classification

Sentiment analysis, topic classification, multi-label, imbalanced classes.

Critical

NLP Engineer, Data Scientist

Named Entity Recognition

NER models, spaCy, custom entities, BIO tagging, domain-specific NER.

High

NLP Engineer, Data Scientist

Embeddings

Word2Vec, GloVe, sentence embeddings, BERT embeddings, embedding visualization.

Critical

NLP Engineer, ML Engineer

Transformers for NLP

BERT, RoBERTa, DistilBERT, fine-tuning for downstream tasks, Hugging Face.

Critical

NLP Engineer, ML Engineer

Text Generation

Language models, beam search, sampling (temperature, top-k, top-p), repetition.

High

NLP Engineer, AI Engineer

Question Answering

Extractive QA, reading comprehension, SQuAD, open-domain QA.

High

NLP Engineer, AI Engineer

Summarization

Extractive vs abstractive, BART, T5, evaluation (ROUGE), length control.

Medium

NLP Engineer, AI Engineer

Machine Translation

Seq2seq, attention, multilingual models, mBART, translation quality.

Medium

NLP Engineer, ML Engineer

spaCy/Hugging Face

Pipeline components, custom components, Hugging Face Hub, Transformers library.

Critical

NLP Engineer, ML Engineer

Personal Status

Topic Level Evidence Active Projects Gaps

No personal status recorded

 — 

 — 

 — 

 —