56 lines
1.3 KiB
Markdown
56 lines
1.3 KiB
Markdown
# 示例
|
|
|
|
## 提取完整文档内容
|
|
|
|
```bash
|
|
# DOCX
|
|
uv run --with "markitdown[docx]" skills/lyxy-reader-office/scripts/parser.py /path/to/report.docx
|
|
|
|
# PPTX
|
|
uv run --with "markitdown[pptx]" skills/lyxy-reader-office/scripts/parser.py /path/to/slides.pptx
|
|
|
|
# XLSX
|
|
uv run --with "markitdown[xlsx]" skills/lyxy-reader-office/scripts/parser.py /path/to/data.xlsx
|
|
|
|
# PDF
|
|
uv run --with "markitdown[pdf]" --with pypdf skills/lyxy-reader-office/scripts/parser.py /path/to/doc.pdf
|
|
```
|
|
|
|
## 获取文档字数
|
|
|
|
```bash
|
|
uv run --with "markitdown[docx]" skills/lyxy-reader-office/scripts/parser.py -c /path/to/report.docx
|
|
```
|
|
|
|
## 提取所有标题
|
|
|
|
```bash
|
|
uv run --with "markitdown[docx]" skills/lyxy-reader-office/scripts/parser.py -t /path/to/report.docx
|
|
```
|
|
|
|
## 提取指定章节
|
|
|
|
```bash
|
|
uv run --with "markitdown[docx]" skills/lyxy-reader-office/scripts/parser.py -tc "第一章" /path/to/report.docx
|
|
```
|
|
|
|
## 搜索关键词
|
|
|
|
```bash
|
|
uv run --with "markitdown[docx]" skills/lyxy-reader-office/scripts/parser.py -s "关键词" -n 3 /path/to/report.docx
|
|
```
|
|
|
|
## PDF OCR 高精度解析
|
|
|
|
```bash
|
|
uv run --with docling --with pypdf skills/lyxy-reader-office/scripts/parser.py /path/to/scanned.pdf --high-res
|
|
```
|
|
|
|
## 降级到直接 Python 执行
|
|
|
|
仅当 lyxy-runner-python skill 不存在时使用:
|
|
|
|
```bash
|
|
python3 skills/lyxy-reader-office/scripts/parser.py /path/to/file.docx
|
|
```
|