SkillHub ClubShip Full StackFull Stack

ebook-extractor

Use when user wants to extract text from ebooks (EPUB, MOBI, PDF). Use for converting ebooks to plain text for analysis, processing, or reading. Handles all common ebook formats.

Packaged view

This page reorganizes the original catalog entry around fit, installability, and workflow context first. The original raw source lives below.

Stars

Hot score

Updated

March 20, 2026

Overall rating

C4.3

Composite score

4.3

Best-practice grade

S100.0

Install command

npx @skill-hub/cli install ratacat-claude-skills-ebook-extractor

Repository

ratacat/claude-skills

Skill path: skills/ebook-extractor

Use when user wants to extract text from ebooks (EPUB, MOBI, PDF). Use for converting ebooks to plain text for analysis, processing, or reading. Handles all common ebook formats.

Open repository

Best for

Primary workflow: Ship Full Stack.

Technical facets: Full Stack.

Target audience: everyone.

License: Unknown.

Original source

Catalog source: SkillHub Club.

Repository owner: ratacat.

This is still a mirrored public skill entry. Review the repository before installing into production workflows.

What it helps with

Install ebook-extractor into Claude Code, Codex CLI, Gemini CLI, or OpenCode workflows
Review https://github.com/ratacat/claude-skills before adding ebook-extractor to shared team environments
Use ebook-extractor for development workflows

Works across

Claude CodeCodex CLIGemini CLIOpenCode

Favorites: 0.

Sub-skills: 0.

Aggregator: No.

Original source / Raw SKILL.md

---
name: ebook-extractor
description: Use when user wants to extract text from ebooks (EPUB, MOBI, PDF). Use for converting ebooks to plain text for analysis, processing, or reading. Handles all common ebook formats.
---

# Ebook Text Extractor

## Overview
Extract plain text from EPUB, MOBI, and PDF files using Python scripts. No LLM calls - pure text extraction.

## Supported Formats

| Format | Tool Used | Notes |
|--------|-----------|-------|
| EPUB | `ebooklib` + `BeautifulSoup` | Direct parsing, preserves structure |
| MOBI | Calibre `ebook-convert` | Converts to EPUB first, then extracts |
| PDF | `PyMuPDF` (fitz) | Fast, handles most PDFs well |

## Usage

**Unified extractor (auto-detects format):**
```bash
python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.epub
python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.mobi
python3 ~/.claude/skills/ebook-extractor/scripts/extract.py /path/to/book.pdf
```

**Output options:**
```bash
# To stdout (default)
python3 scripts/extract.py book.epub

# To file
python3 scripts/extract.py book.epub -o output.txt
python3 scripts/extract.py book.epub > output.txt
```

**Format-specific scripts:**
```bash
python3 scripts/extract_epub.py book.epub
python3 scripts/extract_mobi.py book.mobi
python3 scripts/extract_pdf.py book.pdf
```

## Setup

```bash
# One-command setup (installs all dependencies)
~/.claude/skills/ebook-extractor/setup.sh

# Or manually:
pip install -r ~/.claude/skills/ebook-extractor/requirements.txt
brew install calibre  # macOS, for MOBI support
```

## Script Location
`~/.claude/skills/ebook-extractor/scripts/`

## Common Issues

| Problem | Solution |
|---------|----------|
| Missing package | Run `setup.sh` or `pip install -r requirements.txt` |
| MOBI fails | Ensure Calibre is installed: `brew install calibre` |
| PDF garbled | Some PDFs are image-based; OCR needed (not supported) |