mirror of
https://github.com/asciinema/asciinema.git
synced 2026-04-26 00:15:56 +03:00
[GH-ISSUE #592] Proposal: collect dataset #927
Labels
No labels
bug
compatibility
feature request
fit for beginners
help wanted
hosting
idea
improvement
packaging
pull-request
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
starred/asciinema#927
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @elexunix on GitHub (Dec 2, 2023).
Original GitHub issue: https://github.com/asciinema/asciinema/issues/592
Hello guys!
Maybe it would be of interest or fun to collect a moderately large asciinema recordings dataset, from many different users -- for that, just
asciinema recyour terminal, if you are not doing something too personal there, and then share the recording. Perhaps, we can collect them together to a dataset of casts, and then, since the asciinema recording structure is luckily simple, train an LLM on that corpus, and have fun watching "realistic" (in the view of that NN) casts of doing something in a terminalWhat do you think about collectively collecting such a dataset? I have a 4090, can train the LLM on it
:)
@ku1ik commented on GitHub (Jan 19, 2024):
How would you train the model? You're thinking of some RNN like GRU, or rather a transformer model? What about the timing information - would this be part of the model as well, or you were thinking of training on the raw output only?