Phase 0: Toolchain
Phase 0: Toolchain
Install, configure, and validate every tool in the forensics stack. Each tool gets a verification test and a codex entry.
Installation
All tools installed on Arch Linux via pacman unless noted.
# Core — document + image + forensics
sudo pacman -S poppler tesseract tesseract-data-eng imagemagick \
perl-image-exiftool binwalk testdisk
# Sleuth Kit (EnCase CLI replacement)
sudo pacman -S sleuthkit
# Network forensics
sudo pacman -S wireshark-cli tcpdump ngrep
# Deduplication
sudo pacman -S fdupes
# AUR
yay -S hashdeep
Installed & Verified (2026-04-19)
| Tool | Version | Status |
|---|---|---|
|
13.50 |
✅ |
|
5.5.2 |
✅ |
|
7.1.2-18 |
✅ |
|
26.03.0 |
✅ |
|
2.46 |
✅ |
|
installed |
✅ |
|
4.4 |
✅ |
|
2.4.0 |
✅ |
|
7.2 |
✅ |
|
4.14.0 |
✅ |
|
4.14.0 |
✅ |
|
4.14.0 |
✅ |
|
4.14.0 |
✅ |
|
4.14.0 |
✅ |
|
4.14.0 |
✅ |
|
4.14.0 |
✅ |
|
4.14.0 |
✅ |
|
4.14.0 |
✅ |
|
4.6.4 |
✅ |
|
4.99.6 |
✅ |
|
installed |
✅ |
Not Yet Installed (AUR / Source)
| Tool | Purpose | Install |
|---|---|---|
|
Forensic dd with hashing |
AUR |
|
E01 (EnCase format) imaging |
AUR |
|
Configurable file carving |
AUR |
|
Header-based file carving |
AUR |
|
Artifact extraction (emails, URLs) |
AUR |
|
Perceptual hash dedup |
AUR |
|
Memory forensics |
AUR / pip |
|
TCP session reconstruction |
AUR |
|
PNG steganography detection |
|
Document Tools
| Tool | Purpose | Package |
|---|---|---|
|
Extract text from PDF |
|
|
PDF metadata inspection |
|
|
Extract embedded images from PDF |
|
|
OCR — image to text |
|
|
Format conversion (docx, epub, html, markdown) |
|
|
DjVu text extraction |
|
|
Legacy .doc extraction |
|
Image & Metadata Tools
| Tool | Purpose | Package |
|---|---|---|
|
EXIF/IPTC/XMP metadata (read/write/strip) |
|
|
Image properties (dimensions, depth, format) |
|
|
Image transformation, format conversion |
|
|
Exact duplicate detection |
|
|
Perceptual hash deduplication |
|
|
PNG/BMP steganography detection |
|
|
JPEG steganography detection |
AUR / source |
Disk & Filesystem Forensics (EnCase CLI Equivalents)
| Tool | Purpose | Package |
|---|---|---|
|
Bit-for-bit disk imaging with hashing |
|
|
Create E01 (EnCase format) forensic images |
|
|
Inspect and verify E01 images |
|
|
Partition table analysis |
|
|
Filesystem metadata |
|
|
List files and directories (including deleted) |
|
|
Extract file by inode number |
|
|
Inode metadata inspection |
|
|
Timeline generation from body file |
|
|
Extract raw data blocks |
|
|
Disk image metadata |
|
|
Signature-based searching in images |
|
|
Categorize files by type in disk image |
|
|
Hash database lookup (NSRL, known-bad) |
|
|
File carving from raw disk/image |
|
|
Configurable file carving |
|
|
Header-based file carving |
|
|
Extract emails, URLs, credit cards from images |
|
Memory Forensics
| Tool | Purpose | Package |
|---|---|---|
|
Memory dump analysis (processes, network, registry) |
|
|
Extract printable strings from binary/memory |
|
|
Hex dump with ASCII |
|
|
Hex/octal/decimal dump |
|
|
Firmware and embedded file extraction |
|
Network Forensics
| Tool | Purpose | Package |
|---|---|---|
|
CLI packet analysis (Wireshark engine) |
|
|
Packet capture and filtering |
|
|
Network grep — pattern match on packets |
|
|
Reconstruct TCP sessions from pcap |
|
|
Network forensic analysis (runs on Mono) |
source / AUR |
Integrity & Hashing
| Tool | Purpose | Package |
|---|---|---|
|
Cryptographic hash verification |
|
|
BLAKE2 hashing (faster, equally secure) |
|
|
MD5 hashing (legacy, court-accepted) |
|
|
File integrity monitoring (tripwire alternative) |
|
|
Recursive hashing with audit mode |
|
Validation Test: Personal Photo (2026-04-19)
First live test: 20260403_185029.jpg — personal photo emailed from Galaxy Z Fold7.
Tools Used
-
exiftool -a -G1— full grouped EXIF/IPTC/XMP/ICC/Samsung/MPF extraction -
binwalk— embedded file signature scan -
strings -n 12— printable string extraction from binary -
identify -verbose— ImageMagick pixel-level analysis
Context Extracted
| Layer | Findings |
|---|---|
Device |
Samsung Galaxy Z Fold7, firmware |
Timestamp |
2026-04-03 18:50:29.622 PDT (UTC-07:00), sub-second precision |
Camera settings |
f/2.2, 1/60s, ISO 1000, 2.2mm (23mm equiv), 1.66x digital zoom, no flash — confirms indoor low-light |
Carrier |
MCC 310 — United States / Guam |
GPS |
Not present — either stripped by Outlook or disabled on device |
Color pipeline |
DCI-P3 D65 gamut with sRGB transfer, Samsung ICC profile (2022-07-01) |
HDR |
Dual-image MPF container — primary JPEG (1.4MB) + gain map (51KB) per ISO 21496-1 |
Image sensor |
|
Samsung internals |
Unique ID |
Pixel statistics |
12MP (4000x3000), 8-bit sRGB, mean luminance 36% — dark scene, warm color cast (red channel dominant) |
Steganography |
|
Embedded data |
512x384 thumbnail (54KB), XMP with Adobe Core 5.1.2, Google GContainer HDR metadata |
Forensic Significance
-
No GPS but carrier country is exposed — MCC 310 narrows to US even without coordinates
-
Sony IMX564 sensor ID + calibration string — ties this image to a specific sensor lot, potentially traceable to manufacturing batch
-
Device serial fragment (
0c57623f073a) — partial MAC or hardware identifier embedded by Samsung -
Firmware build (
F966U1UES8AZC1) — exact software version, checkable against CVE databases -
Sub-second timestamp — 622ms precision, useful for timeline correlation
-
Unique Image ID —
S12XSRJ00NMpersists across copies, forwards, uploads — the photo is forever linked to this device