1. 제품
  2.   Aspose.Words
  3.   Aspose.Words FOSS for Python

Aspose.Words Python용 FOSS

Python에서 Word 문서를 PDF, Markdown 및 텍스트로 변환하세요 — 무료이며 오픈 소스, Microsoft Office가 필요 없습니다.

워드 문서 변환을 위한 오픈소스 파이썬 라이브러리

Aspose.Words FOSS for Python은 MIT 라이선스를 가진 Python 라이브러리로, Word 문서를 로드하고 변환할 수 있습니다. DOCX, DOC, RTF, TXT, Markdown 파일을 읽고 Microsoft Office나 기타 독점 런타임 없이 PDF, Markdown, 일반 텍스트로 내보낼 수 있습니다.

이 라이브러리는 파일 로드를 위한 Document 클래스와 save() 메서드를 제공하며, SaveFormat 상수 또는 PdfSaveOptions, MarkdownSaveOptions와 같은 저장 옵션 객체를 받아 세밀한 출력 제어가 가능합니다.

pip install aspose-words-foss>=26.4.0 로 설치합니다. 이 라이브러리는 Python 3.10 이상이 필요하며 olefile, fpdf2, pydantic에 의존합니다. MIT 라이선스를 갖고 있으며 GitHub에서 완전 오픈소스로 제공됩니다.

Document Conversion

  • Multi-format input: Load documents from DOCX, DOC, RTF, TXT, and Markdown formats via the Document class.
  • PDF export: Convert any input document to PDF using SaveFormat.PDF or PdfSaveOptions.
  • Markdown export: Export to Markdown with SaveFormat.MARKDOWN or MarkdownSaveOptions.
  • Text extraction: Extract plain text content from documents using Document.get_text().
  • Plain text export: Save documents as plain text using SaveFormat.TEXT.

Where Aspose.Words FOSS Can Be Used

  • Document pipelines: Convert uploaded Word documents to PDF in backend services.
  • Content extraction: Extract text from DOCX or DOC files for indexing and search.
  • Format migration: Batch-convert legacy DOC/RTF archives to modern Markdown or PDF.
  • CI/CD automation: Generate PDF reports from Markdown or DOCX templates in build pipelines.

Save Options and Customization

  • PdfSaveOptions: Control PDF output settings when converting documents to PDF.
  • MarkdownSaveOptions: Configure Markdown-specific export options.
  • SaveFormat constants: Use SaveFormat.MARKDOWN, SaveFormat.PDF, and SaveFormat.TEXT for quick conversion.
  • Document readers: Dedicated readers for DOC, RTF, TXT, and Markdown input formats.

Developer Experience

Aspose.Words FOSS installs with pip install aspose-words-foss. Runtime dependencies (olefile, fpdf2, pydantic) are installed automatically.

The API is straightforward: load a Document from a file path, then call save() with a target path and format. For advanced control, pass a save-options object instead of a format constant. The library is MIT-licensed, open-source on GitHub, and requires Python 3.10 or later.

Convert DOCX to Markdown

Load a Word document and save it as Markdown in two lines of code.

import aspose.words_foss as aw

doc = aw.Document("input.docx")  # or .doc, .rtf, .txt, .md
doc.save("output.md", aw.SaveFormat.MARKDOWN)

Convert DOCX to PDF

Export a Word document to PDF format.

import aspose.words_foss as aw

doc = aw.Document("input.docx")
doc.save("output.pdf", aw.SaveFormat.PDF)

Extract Text from a Document

Read all text content from a Word document.

import aspose.words_foss as aw

doc = aw.Document("input.docx")
text = doc.get_text()

자주 묻는 질문

What license does Aspose.Words FOSS for Python use?

Aspose.Words FOSS for Python is released under the MIT license. You can use, modify, and distribute it in commercial and personal projects.

How do I install Aspose.Words FOSS for Python?

Install via pip with pip install aspose-words-foss>=26.4.0. Requires Python 3.10 or later.

Which document formats are supported?

The library reads DOCX, DOC, RTF, TXT, and Markdown files and exports to PDF, Markdown, and plain text.

How do I convert a DOCX file to PDF?

Load with Document("input.docx") and call doc.save("output.pdf", SaveFormat.PDF).

  

지원 및 학습 자료

 한국어