Pushing Open-Source LLM Inference to Its Limits

How We Achieved 5.4x Cheaper Inference than Together AI at AlpineX

Tarik Moon, Adib Hasan, Daniel Schaffield

2024-11-17

Pushing Open-Source LLM Inference to Its Limits