Quote:
Originally Posted by sheldonrrr
Hey vyan,
This message indicates that your request exceeded the 60-second generation time and was aborted.
I tested two small models locally on a Mac mini M2 machine: qwen2.5:latest (you must specify qwen2.5 on the configuration page to run successfully) and qwen2.5:0.5b. There were no issues.
I recommend testing with a smaller model first. It's possible that the generation time is too long due to a mismatch between your machine and the model size.
I will fix this issue and try to increase the maximum request time. Thank you for your valuable feedback!
|
Hey Sheldon: Thanks for your reply.
I will try your "small model" advice. Also, can you make this "max request time" a configurable item? I could imagine different hardware prowess may warrant a different value.
Just my $0.0.1