modify testing concurrency
veerbiapushed 1 commit to main • 139cf0d…a3887b4 • yesterday
update orpheus-tts-streaming
veerbiapushed 1 commit to main • 32d43f1…139cf0d • yesterday
trt-llm config A100 -> H100
move from A100 to H100 default in config.yaml
feat: add orpheus tts with streaming (
#422 )
Pull request merge
veerbiapushed 1 commit to main • 2827ad4…7cbfbb4 • 7 days ago
Update config.yaml to add hf-token secret
dsingal0pushed 1 commit to main • dd4b565…2827ad4 • 8 days ago
update docs for deploying via trt-llm
veerbiapushed 1 commit to main • 5c9e9ec…f53793b • 11 days ago
veerbiapushed 1 commit to main • 52d57da…5c9e9ec • 11 days ago
bdubayahpushed 1 commit to main • 2d072a6…52d57da • 14 days ago
add refactor of example model input
add chat template deployment
add chat template deployment
dsingal0pushed 1 commit to main • b69f4ad…9367514 • 16 days ago
rm xtts-streaming websocket changes
veerbiapushed 1 commit to main • d33f22d…b69f4ad • 17 days ago
add sesame-csm-1b example
veerbiapushed 1 commit to main • fc07ffb…d33f22d • 19 days ago
add qwen-qwq correction + add mixedbread.ai classification model
You can’t perform that action at this time.