[GH-ISSUE #117] Files parsed via upload but error via SMB share crawler (JVM exceptions) #115

Closed
opened 2026-02-27 15:55:06 +03:00 by kerem · 1 comment
Owner

Originally created by @david-morton on GitHub (Jan 17, 2018).
Original GitHub issue: https://github.com/RD17/ambar/issues/117

Just installed everything today on Xenial and can successfully upload a PDF and a PNG via the GUI. OCR is performed, everything looks great.

However, the same files dropped into a SMB share which I've configured a crawler for report errors as below ... is this a known issue ? Permissions should be fine as I also mount this share and can interact with files just fine.

2018-01-17 21:25:02.668: [verbose] [p0] file content received //fluidnas/receipts/to_consume/screen shot 2018-01-18 at 9.37.21 am.png
2018-01-17 21:25:02.683: [verbose] [p0] parsing //fluidnas/receipts/to_consume/screen shot 2018-01-18 at 9.37.21 am.png
2018-01-17 21:25:02.796: [error] [p0] error parsing //fluidnas/receipts/to_consume/screen shot 2018-01-18 at 9.37.21 am.png JVM exception occurred: image/png parse error
2018-01-17 21:27:39.902: [verbose] [p0] task received f4a6a3283e2592f73725f58770899754e5309112befadb10e7c4975c81279dd1
2018-01-17 21:27:40.104: [verbose] [p0] file content received //fluidnas/receipts/to_consume/ammendments.pdf
2018-01-17 21:27:40.117: [verbose] [p0] parsing //fluidnas/receipts/to_consume/ammendments.pdf
2018-01-17 21:27:40.142: [error] [p0] error parsing //fluidnas/receipts/to_consume/ammendments.pdf JVM exception occurred: Missing root object specification in trailer.
Originally created by @david-morton on GitHub (Jan 17, 2018). Original GitHub issue: https://github.com/RD17/ambar/issues/117 Just installed everything today on Xenial and can successfully upload a PDF and a PNG via the GUI. OCR is performed, everything looks great. However, the same files dropped into a SMB share which I've configured a crawler for report errors as below ... is this a known issue ? Permissions should be fine as I also mount this share and can interact with files just fine. ``` 2018-01-17 21:25:02.668: [verbose] [p0] file content received //fluidnas/receipts/to_consume/screen shot 2018-01-18 at 9.37.21 am.png 2018-01-17 21:25:02.683: [verbose] [p0] parsing //fluidnas/receipts/to_consume/screen shot 2018-01-18 at 9.37.21 am.png 2018-01-17 21:25:02.796: [error] [p0] error parsing //fluidnas/receipts/to_consume/screen shot 2018-01-18 at 9.37.21 am.png JVM exception occurred: image/png parse error 2018-01-17 21:27:39.902: [verbose] [p0] task received f4a6a3283e2592f73725f58770899754e5309112befadb10e7c4975c81279dd1 2018-01-17 21:27:40.104: [verbose] [p0] file content received //fluidnas/receipts/to_consume/ammendments.pdf 2018-01-17 21:27:40.117: [verbose] [p0] parsing //fluidnas/receipts/to_consume/ammendments.pdf 2018-01-17 21:27:40.142: [error] [p0] error parsing //fluidnas/receipts/to_consume/ammendments.pdf JVM exception occurred: Missing root object specification in trailer. ```
kerem 2026-02-27 15:55:06 +03:00
  • closed this issue
  • added the
    bug
    label
Author
Owner

@sochix commented on GitHub (Apr 19, 2018):

@david-morton i think it's a broken files, try to open it with some external app

<!-- gh-comment-id:382659770 --> @sochix commented on GitHub (Apr 19, 2018): @david-morton i think it's a broken files, try to open it with some external app
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ambar#115
No description provided.