ing

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need…

Stars 8,484

llmpeftfinetuning

研究学习 / 检索整理

optimizing-attention-flash

uioptimizingattentionflash

Optimizes transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Use when training/running transformers with long sequences…

Stars 8,483

研究学习 / 检索整理

skypilot-multi-cloud-orchestration

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage…

Stars 8,489

uiragazureskypilot

研究学习 / 检索整理

nanogpt

designuinanogpteducational

Educational GPT implementation in ~300 lines. Reproduces GPT-2 (124M) on OpenWebText. Clean, hackable code for learning transformers. By Andrej Karpathy.…

Stars 8,482

研究学习 / 检索整理

clip

OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M…

Stars 8,472

githubclipopenaimodel

研究学习 / 检索整理

pytorch-fsdp2

Adds PyTorch FSDP2 (fully_shard) to training scripts with correct init, sharding, mixed precision/offload config, and distributed checkpointing. Use when…

Stars 8,486

apiagentpytorchfsdp2

研究学习 / 检索整理

gptq

Post-training 4-bit quantization for LLMs with minimal accuracy loss. Use for deploying large models (70B, 405B) on consumer GPUs, when you need 4× memory…

Stars 8,475

llmgptqposttraining

研究学习 / 检索整理

lambda-labs-gpu-cloud

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent…

Stars 8,478

uiperformanceraglambda

研究学习 / 检索整理

pinecone

Managed vector database for production AI applications. Fully managed, auto-scaling, with hybrid search (dense + sparse), metadata filtering, and namespaces.…

Stars 8,485

uidatabaseragpinecone

研究学习 / 检索整理

blip-2-vision-language

Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text…

Stars 8,471

uiragllmblip

研究学习 / 检索整理

phoenix-observability

Open-source AI observability platform for LLM tracing, evaluation, and monitoring. Use when debugging LLM applications with detailed traces, running…

Stars 8,485

uillmpromptdebugging

研究学习 / 检索整理

segment-anything-model

Foundation model for image segmentation with zero-shot transfer. Use when you need to segment any object in images using points, boxes, or masks as prompts, or…

Stars 8,487

uipromptsegmentanything

研究学习 / 检索整理

nnsight-remote-interpretability

Provides guidance for interpreting and manipulating neural network internals using nnsight with optional NDIF remote execution. Use when needing to run…

Stars 8,482

uigithubnnsightremote

研究学习 / 检索整理

model-pruning

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50%…

Stars 8,482

llmmodelpruningreduce

研究学习 / 检索整理

gitnexus-pr-review