mirror of
https://github.com/RD17/ambar.git
synced 2026-04-25 15:35:49 +03:00
[GH-ISSUE #223] Indexing files without extensions #217
Labels
No labels
$$ Paid Support
bug
bug
enhancement
help wanted
invalid
pull-request
question
question
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/ambar#217
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @tristanolive on GitHub (Mar 7, 2019).
Original GitHub issue: https://github.com/RD17/ambar/issues/223
We have a large set of files that do not have file extensions such as .txt or .png or .pdf, but this information is available in the file metadata. Ambar reports the following in the log:
path/to/file ignoring. Rule: File should have extensionIs there a configuration or other method by which to enable a crawler to index these files?
@stale[bot] commented on GitHub (Mar 22, 2019):
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
@tristanolive commented on GitHub (Mar 29, 2019):
The file command on linux provides detection of file types, which could then be used to determine what processing is necessary without relying on an extension. For example, running
file ... | awk {'print $2 "_" $3'}gives something like:Could this be a simple addition to where file type filters are currently in place? I think it would go a long way in the maturity of this product.