PyTorch native quantization and sparsity for training and inference
-
Updated
Sep 19, 2024 - Python
PyTorch native quantization and sparsity for training and inference
A lightweight framework that enables serverless users to reduce their bills by harvesting non-serverless compute resources such as their VMs, on-premise servers, or personal computers.
Running large language models on a single GPU for throughput-oriented scenarios.
A system allowing for the automatic offloading of contents of an SD card.
Offload heavy processing from your requests to separate scope. Simple configuration and usage.
Multi Criteria Decision Making for automotive cloud/edge application models
Run Mixtral-8x7B models in Colab or consumer desktops
This is the main repository for REU application on drone offloading and more
A collection of tests for the Open vSwitch HW offload.
A collection of tests for the Open vSwitch HW offload.
Offloading Resource-Intensive Tasks to Raspberry Pi (or IoT Devices) Using SSH
DPU-Powered File System Virtualization over virtio-fs
Backend.AI Client Library for Python
Supporting page for the manuscript titled "P4toNFV: Offloading from P4 Switches to NFV in Programmable Data Planes."
Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)
A Wordpress plugin to offload all attachments to a GitHub repo.
基于 DPDK 和智能网卡的流量卸载试验. A flow offloading prototype base on DPDK and Mellanox/Nvidia SmartNIC.
OpenMP Matrix Multiplication Offloading Playground
Java implementation of Home Edge project with sub set of features for Android support
Add a description, image, and links to the offloading topic page so that developers can more easily learn about it.
To associate your repository with the offloading topic, visit your repo's landing page and select "manage topics."