SuspiciousCarrot78@aussie.zone to Selfhosted@lemmy.worldEnglish · 2 days agoDo you host your own AI?message-squaremessage-square178fedilinkarrow-up1150arrow-down137file-text
arrow-up1113arrow-down1message-squareDo you host your own AI?SuspiciousCarrot78@aussie.zone to Selfhosted@lemmy.worldEnglish · 2 days agomessage-square178fedilinkfile-text
minus-squarefubarx@lemmy.worldlinkfedilinkEnglisharrow-up2·1 day agoFound vLLM to be the most efficient local runtime service. And “ray” as a good (but complicated) way to distribute the load: https://docs.ray.io/
Found vLLM to be the most efficient local runtime service. And “ray” as a good (but complicated) way to distribute the load: https://docs.ray.io/