Skip to content

dont break on broken pdfs #13

@puhoy

Description

@puhoy
Traceback (most recent call last):
  File "/home/meatpuppet/.local/bin/lspace", line 10, in <module>
    sys.exit(cli_group())
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/click/core.py", line 764, in __call__
    return self.main(*args, **kwargs)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/click/core.py", line 717, in main
    rv = self.invoke(ctx)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/click/core.py", line 1137, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/click/core.py", line 956, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/click/core.py", line 555, in invoke
    return callback(*args, **kwargs)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/click/decorators.py", line 17, in new_func
    return f(get_current_context(), *args, **kwargs)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/flask/cli.py", line 419, in decorator
    return __ctx.invoke(f, *args, **kwargs)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/click/core.py", line 555, in invoke
    return callback(*args, **kwargs)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/lspace/cli/import_command/__init__.py", line 14, in import_command
    import_wizard(path, skip_library_check, move)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/lspace/cli/import_command/_import.py", line 75, in import_wizard
    return import_file_wizard(path, skip_library_check, move)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/lspace/cli/import_command/_import.py", line 98, in import_file_wizard
    isbns_with_metadata = file_type_object.fetch_results()
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/lspace/file_types/_base.py", line 95, in fetch_results
    for result in f():
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/lspace/file_types/_base.py", line 58, in find_isbn_in_text
    isbns = self.get_isbns_from_text()
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/lspace/file_types/_base.py", line 122, in get_isbns_from_text
    pages = self.get_text()
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/lspace/file_types/pdf.py", line 25, in get_text
    extracted_text = page.extractText()
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/PyPDF2/pdf.py", line 2595, in extractText
    content = ContentStream(content, self.pdf)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/PyPDF2/pdf.py", line 2673, in __init__
    stream = BytesIO(b_(stream.getData()))
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/PyPDF2/generic.py", line 841, in getData
    decoded._data = filters.decodeStreamData(self)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/PyPDF2/filters.py", line 346, in decodeStreamData
    data = FlateDecode.decode(data, stream.get("/DecodeParms"))
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/PyPDF2/filters.py", line 111, in decode
    data = decompress(data)
  File "/home/meatpuppet/.local/lib/python3.7/site-packages/PyPDF2/filters.py", line 49, in decompress
    return zlib.decompress(data)
zlib.error: Error -5 while decompressing data: incomplete or truncated stream

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions