mirror of
https://github.com/jehna/humanify.git
synced 2026-05-02 12:05:59 +03:00
[GH-ISSUE #139] Error working with large files - Gemini #48
Labels
No labels
bug
enhancement
pull-request
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/humanify#48
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @Bruno-Alumn on GitHub (Oct 1, 2024).
Original GitHub issue: https://github.com/jehna/humanify/issues/139
So I'm trying to de-obfuscate a file that has 3781204 characters. I know its quite a bit...
I decided to go with Gemini due to the disclaimer of speed with OpenAI. Firstly the progress is incredibly slow (probably due to the file size). Anyways, after running this command and two hours (getting to 3% completion)
I get this error:
Now from what I understand this is completely google's issue as stated in this thread. The top answer does suggest implementing a "back-off" feature, similar to rate-limit prevention techniques. Is it possible to do this? or is there another cause to the issue to do with my machine?
tysm
@0xdevalias commented on GitHub (Oct 2, 2024):
Related issues:
@Bruno-Alumn Based on a quick skim of the error + StackOverflow page, I think you're right in thinking this is more of a problem with Google +
humanifynot having a good 'error recovery mechanism' rather than anything specific with your machine.@Bruno-Alumn commented on GitHub (Oct 2, 2024):
Thank you very much for the reply @0xdevalias! Ill look into contributing to the project to solve the issue though it seems to have already been mentioned in the thread you linked. Either way, for my current situation, I decided to use OpenAI. It is true, the speed it noticeably slower, though i still had one question:

Does the following seem like an accurate ratio between input and output tokens?
@0xdevalias commented on GitHub (Oct 20, 2024):
@Bruno-Alumn I don't really know what the ratios should look like specifically.. but I would definitely expect input to be MUCH higher, as that's going to be providing all of the code being referenced; whereas output should just be essentially giving the variable renames back; so that doesn't look overly surprising to me.
@0xdevalias commented on GitHub (Oct 23, 2024):
@marijnbent commented on GitHub (Mar 18, 2025):
Still the same issue with Gemini (free). The paid version does work.
Problem is that the script stopped working. At 13% is got stuck (is a big, 1.8 million token file).
@0xdevalias commented on GitHub (Mar 18, 2025):
@marijnbent That makes sense, as I don't think any PR's have been merged to change the code to introduce rate limiting/etc.
Have you tried hacking something like the workaround proposed above into your local copy?
@marijnbent When you say the script 'stopped working' / 'got stuck'; do you have any more details about what kind of 'stopped' / 'stuck'? Was it with a rate limiting type error message; or some other kind of error; or? It's hard to debug/propose potential solutions without more details.