Skip to content

caelooney/python-docx2txt

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

python-docx2txt

A pure python-based utility to extract text from docx files.

The code is taken and adapted from python-docx. It can however also extract text from header, footer and hyperlinks.

How to install?

pip install docx2txt

How to run?

a. From command line:

docx2txt file.docx

b. From python:

import docx2txt
text = docx2txt.process("file.docx")

Found Bugs?

Report to ankush dot shah dot nitk at gmail dot com

About

A pure python-based utility to extract text from docx files.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%