Documentation Scraper
Overview
The netapi docs command group provides a universal documentation scraper with site-specific optimizations.
Commands
ise
Scrape Cisco ISE Admin Guide:
# All chapters
netapi docs ise --version 3.2 --chapters all
# Specific chapter
netapi docs ise --version 3.2 --chapters 2
# Markdown output
netapi docs ise --version 3.2 --format markdown --chapters 1
arch
Scrape Arch Linux Wiki:
netapi docs arch pacman --output-dir /tmp/arch-docs
netapi docs arch "System maintenance" --output-dir /tmp/arch-docs
github
Scrape GitHub repository docs:
netapi docs github pallets/flask --output-dir /tmp/flask-docs
netapi docs github pallets/flask --docs-path docs --output-dir /tmp/docs
Supported Sites
The scraper auto-detects optimal CSS selectors for common sites:
| Site | Selector | Notes |
|---|---|---|
wiki.archlinux.org |
|
Wiki pages |
man.archlinux.org |
|
Man pages |
imslp.org |
|
Sheet music wiki |
docs.python.org |
|
Python docs |
readthedocs.io |
|
RTD projects |
github.com |
|
READMEs |
developer.cisco.com |
|
DevNet |