Engineering, /
Static Thread Mapping for High Concurrency XGBoost Inference Servers
Introduction With the proliferation of machine learning (ML) models in all aspects of business, model serving has become a significant area of focus. The rise of high-concurrency (near) real-time ML...