-
Notifications
You must be signed in to change notification settings - Fork 145
Issues: aws-neuron/aws-neuron-sdk
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Number of instructions (8658944) is over the threshold (5000000)
#987
opened Sep 18, 2024 by
hasadata
Conversion of YOLOv8x torchscript to torch_neuron failing
#983
opened Sep 17, 2024 by
Harish-Sundaravel
[Pytorch] Inf1, neuron-cc stuck and memory keep incresing to 100G
#978
opened Sep 11, 2024 by
PigletOS
[PyTorch] cxx11 ABI support (torch-neuronx)
inference
pytorch
torch-neuronx
training
#975
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] Top-p sampling on device
inference
pytorch
transformers-neuronx
transformers-neuronx decoder-only LLM inference
#974
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] Diffusion-Transformer (image generation) for inference
inference
NxD
pytorch
#973
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] FP8 KV Cache (weights only)
inference
pytorch
transformers-neuronx
transformers-neuronx decoder-only LLM inference
#972
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] INT8 for inference (weights only)
inference
pytorch
transformers-neuronx
transformers-neuronx decoder-only LLM inference
#971
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] LLaVA model inference
inference
models
NxD
pytorch
#970
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] Multi-LoRA inference
inference
NxDT
torch-neuronx
#969
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] Paged attention in vLLM
inference
pytorch
transformers-neuronx
transformers-neuronx decoder-only LLM inference
#968
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] Combined Continuous Batching and Speculative Decoding
inference
pytorch
torch-neuronx
#967
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] Conformer (WeNet BestRQ) model inference
inference
models
pytorch
torch-neuronx
#966
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] INT8 compute for inference
inference
pytorch
transformers-neuronx
transformers-neuronx decoder-only LLM inference
#964
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] DBRX model inference with Continuous batching
inference
models
NxDT
pytorch
#963
opened Aug 23, 2024 by
eshalakhotia
[PyTorch] Video-LLaVA model training
models
NxD
pytorch
training
#958
opened Aug 23, 2024 by
eshalakhotia
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.