📅  最后修改于: 2022-03-11 14:46:53.503000             🧑  作者: Mango
# 1. Download and install java
# 2. Install python library 'tabular-py' using pip
pip install tabula-py
# If this is the first time installing java and tabula-py
# add your Java installation folder to the PATH variable.
# if you don't, this is the error message you'll get.
tabula.errors.JavaNotFoundError: `java` command is not found from this
Python process.Please ensure Java is installed and PATH is set for `java`
# 3. Import and run the tabula function on the desire page on the pdf file.
import tabula
df = tabula.read_pdf('data.pdf', pages = 3, lattice = True)[1]