Extract text from ebook in python (link to O. epub)
I want to extract the content from an epub, but I don't know how to do this through a direct link to it. Can someone give me a light? I've tried with Tika-python, and at the moment I try with this ebooklib with this code: (but it can be any other solution)
import urllib.request
import ebooklib
from ebooklib import epub
myurl = "https://diegooli.s3.us-east-2.amazonaws.com/Cabana.epub"
with urllib.request.urlopen(myurl) as url:
s = url.read()
book = epub.read_epub(s)
for image in book.get_items_of_type(ebooklib.ITEM_IMAGE):
print(image)
Error:
AttributeError: 'bytes' object has no attribute 'seek'
0