batch-inference

Here are 17 public repositories matching this topic...

ttanzhiqiang / onnx_tensorrt_project

Support Yolov5(4.0)/Yolov5(5.0)/YoloR/YoloX/Yolov4/Yolov3/CenterNet/CenterFace/RetinaFace/Classify/Unet. use darknet/libtorch/pytorch/mxnet to onnx to tensorrt

mxnet pytorch unet darknet classify libtorch retinaface centernet centerface yolov4 yolov5 batch-inference yolor onnx-tensorrt yolox

Updated Aug 2, 2021
C++

louisoutin / yolov5_torchserve

Star

Torchserve server using a YoloV5 model running on docker with GPU and static batch inference to perform production ready and real time inference.

docker deep-learning service pytorch object-detection torchserve batch-inference

Updated Feb 10, 2023
Python

0-mostafa-rezaee-0 / Batch_LLM_Inference_with_Ray_Data_LLM

Star

Batch LLM Inference with Ray Data LLM: From Simple to Advanced

nlp distributed-computing ray parallel-processing batch-inference large-language-models llm ray-serve ray-data vllm llm-api

Updated Apr 12, 2025
Jupyter Notebook

tungngreen / PipelineScheduler

Star

PipelineScheduler optimizes workload distribution between servers and edge devices, setting optimal batch sizes to maximize throughput and minimize latency amid content dynamics and network instability. It also addresses resource contention with spatiotemporal inference scheduling to reduce co-location interference.

model-serving batch-inference gpu-scheduling dnn-serving

Updated Sep 17, 2025
C++

ray-project / ray-saturday-dec-2022

Star

Ray Saturday Dec 2022 edition

computer-vision semantic-segmentation distributed-machine-learning batch-inference ray-distributed ray-core ray-air

Updated Feb 14, 2023
Jupyter Notebook

milenkovicm / torchfusion

Star

Torchfusion is a very opinionated torch inference on datafusion.

rust machine-learning sql inference torch pytorch datafusion userdefined-functions batch-inference

Updated Apr 24, 2025
Rust

SABER-labs / torch_batcher

Star

Serve pytorch inference requests using batching with redis for faster performance.

redis gpu pytorch inference-server batch-inference

Updated Apr 11, 2021
Python

yuwenmichael / Grounding-DINO-Batch-Inference

Star

Support batch inference of Grounding DINO. "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

pytorch object-detection batch-inference groundingdino

Updated Oct 2, 2023
Jupyter Notebook

sutro-sh / sutro

Star

Analyze and generate unstructured data using LLMs, from quick experiments to billion token jobs.

csv s3 pandas data-engineering parquet data-processing data-pipelines observability synthetic-data unstructured-data mlops batch-inference polars llm-inference evals distributed-inference

Updated Oct 2, 2025
Python

milenkovicm / lightfusion

Star

LightGBM Inference on Datafusion

rust machine-learning sql inference lightgbm udf datafusion userdefined-functions batch-inference

Updated Aug 13, 2025
Rust

mili-tan / Onllama.OllamaBatch

Star

简单的 Ollama JSONL 批量推理工具 / Simple Ollama JSONL batch inference tool.

batch batch-processing batch-inference ollama ollama-interface ollama-client ollama-api

Updated Jul 20, 2025
C#

kyoro1 / image_analysis_with_automl_in_azure

Star

This repository provides sample codes, which enable you to learn how to use auto-ml image classification, or object detection under Azure ML(AML) environment.

image-classification object-detection automl azure-machine-learning managed-identity batch-inference azure-machine-learning-pipeline

Updated Aug 4, 2022
Jupyter Notebook

brnaguiar / mlops-next-watch

Star

MLOps project that recommends movies to watch implementing Data Engineering and MLOps best practices.

airflow spark aws-s3 grafana postgresql prometheus artificial-intelligence data-engineering minio recommender-system movie-recommendation dvc mlops mlflow batch-inference batch-scoring

Updated Jul 15, 2024
Jupyter Notebook

ARYAN555279 / Batch_LLM_Inference_with_Ray_Data_LLM

Star

Batch LLM Inference with Ray Data LLM: From Simple to Advanced

nlp distributed-computing ray parallel-processing batch-inference large-language-models llm ray-serve ray-data vllm llm-api

Updated Oct 3, 2025
Dockerfile

rohanchauhan / azure-batch-inference-service

Star

We perform batch inference on lead scoring task using Pyspark.

machine-learning azure pyspark lead-scoring batch-inference

Updated Jul 11, 2021
Jupyter Notebook

sarabesh / sentiment-analysis-finetuning-and-deployment

Star

This repo simulates how an ML model moves to production in an industry setting. The goal is to build, deploy, monitor, and retrain a sentiment analysis model using Kubernetes (minikube) and FastAPI.

kubernetes sentiment-analysis finetuning mlops api-interface fast-api bert-fine-tuning batch-inference

Updated Apr 11, 2025
Jupyter Notebook

kimmmmyy223 / llm-batch

Star

🚀 Process JSON data in batches with `llm-batch`, leveraging sequential or parallel modes for efficient interaction with LLMs.

react python nlp flask aws ops deep-learning rabbitmq distributed-computing bedrock language-model batch-inference dynamic-batching large-language-models ray-data vllm llm-agent llm-inference

Updated Oct 3, 2025
Go

Improve this page

Add a description, image, and links to the batch-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the batch-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

batch-inference

Here are 17 public repositories matching this topic...

ttanzhiqiang / onnx_tensorrt_project

louisoutin / yolov5_torchserve

0-mostafa-rezaee-0 / Batch_LLM_Inference_with_Ray_Data_LLM

tungngreen / PipelineScheduler

ray-project / ray-saturday-dec-2022

milenkovicm / torchfusion

SABER-labs / torch_batcher

yuwenmichael / Grounding-DINO-Batch-Inference

sutro-sh / sutro

milenkovicm / lightfusion

mili-tan / Onllama.OllamaBatch

kyoro1 / image_analysis_with_automl_in_azure

brnaguiar / mlops-next-watch

ARYAN555279 / Batch_LLM_Inference_with_Ray_Data_LLM

rohanchauhan / azure-batch-inference-service

sarabesh / sentiment-analysis-finetuning-and-deployment

kimmmmyy223 / llm-batch

Improve this page

Add this topic to your repo