Skip to content

v91

Compare
Choose a tag to compare
@github-actions github-actions released this 03 Mar 18:36
· 21 commits to main since this release
082d5d0

This release adds the /unload that allows for manually unloading all currently loaded models.

Changelog

  • 082d5d0 Add /unload endpoint (#58) to unload all currently running models
  • 5333893 increase health check to a minimum of 5 seconds
  • af65334 Update README.md w/ starhistory graph
  • 1e25b44 add workflow_dispatch to release action
  • 0815bb4 Add windows to goreleaser #54
  • 7187cfe add Windows build support to Makefile (#54)
  • 24089d2 remove "no musa container" note from README
  • ebabe55 Delete untagged packages after build and push (#55)
  • 41a3382 deletion of untagged containers happen after build-and-push
  • 7e3353e add action step to remove untagged containers
  • 4ed58fb update container build action
  • f5a2be6 revert package src until new ggml-org has them
  • f5e6ec3 fix package src in containerfile
  • 3f462da switch package source from ggerganov to ggml-org
  • 48bd766 Update README.md
  • 8d319da improve README organization (i think...)
  • be7c502 improve docs
  • 92336f0 more container build fixes
  • ed2a50d fix bug in build-container.sh
  • 0acfdb9 update workflow to build cpu and disable musa
  • 96a8ea0 add cpu docker container build
  • f20f2c9 add docs and container build improvements #43
  • 7a97c38 enable parallel container built #46
  • 4885132 more permissions futzing
  • 8b46a0b grant package:write to container workflow #46
  • 1b6736e rename workflow for containers
  • ddc1ce0 fix container file name #46
  • 11d024b just build cuda while debugging
  • 43e23c1 add check for GITHUB_TOKEN #46
  • f9c8e76 add execute bit on build-container.sh
  • d7e1bb9 add GITHUB_TOKEN to container build env
  • ab93460 first container code (#52)