标签 "推理引擎" 的搜索结果:2 个资源
专为大语言模型设计的高性能推理和服务引擎,提供高吞吐量和内存优化解决方案
High-throughput and memory-efficient inference and serving engine for Large Language Models. Deploy AI faster with state-of-the-art performance.