全部能力

uioptimizingattentionflash

Optimizes transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Use when training/running transformers with long sequences…

Stars 8,483

研究学习 / 检索整理

skypilot-multi-cloud-orchestration

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage…

Stars 8,489

uiragazureskypilot

研究学习 / 检索整理

nanogpt

designuinanogpteducational

Educational GPT implementation in ~300 lines. Reproduces GPT-2 (124M) on OpenWebText. Clean, hackable code for learning transformers. By Andrej Karpathy.…

Stars 8,482

研究学习 / 检索整理

peft-fine-tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need…

Stars 8,484

llmpeftfinetuning

研究学习 / 检索整理

clip

OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M…

Stars 8,472

githubclipopenaimodel

研究学习 / 检索整理

mamba-architecture

State-space model with O(n) complexity vs Transformers' O(n²). 5× faster inference, million-token sequences, no KV cache. Selective SSM with hardware-aware…

Stars 8,478

uiuxmambaarchitecture

研究学习 / 检索整理

pinecone

Managed vector database for production AI applications. Fully managed, auto-scaling, with hybrid search (dense + sparse), metadata filtering, and namespaces.…

Stars 8,485

uidatabaseragpinecone

研究学习 / 检索整理

phoenix-observability

Open-source AI observability platform for LLM tracing, evaluation, and monitoring. Use when debugging LLM applications with detailed traces, running…

Stars 8,485

uillmpromptdebugging

研究学习 / 检索整理

blip-2-vision-language

Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text…

Stars 8,471

uiragllmblip

研究学习 / 检索整理

lambda-labs-gpu-cloud

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent…

Stars 8,478

uiperformanceraglambda

研究学习 / 检索整理

rwkv-architecture

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux…

Stars 8,487

uiuxrwkvarchitecture

研究学习 / 检索整理

model-pruning

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50%…

Stars 8,482

llmmodelpruningreduce

研究学习 / 检索整理

segment-anything-model

Foundation model for image segmentation with zero-shot transfer. Use when you need to segment any object in images using points, boxes, or masks as prompts, or…

Stars 8,487

uipromptsegmentanything

研究学习 / 检索整理

nnsight-remote-interpretability