灵感菇

AI 技能的自然生态,你的一句话,蔓延出无限连接。

搜索结果

全部能力

找到 1009 个相关结果 / 研究资料

研究学习 / 检索整理

fine-tuning-with-trl

fine-tuning-with-trl

199

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward…

Stars 8,475
uillmfinetuning

研究学习 / 检索整理

audiocraft-audio-generation

audiocraft-audio-generation

199

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text…

Stars 8,471
uiaudiocraftaudiogeneration

研究学习 / 检索整理

clip

clip

198

OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M…

Stars 8,472
githubclipopenaimodel

研究学习 / 检索整理

gptq

gptq

198

Post-training 4-bit quantization for LLMs with minimal accuracy loss. Use for deploying large models (70B, 405B) on consumer GPUs, when you need 4× memory…

Stars 8,475
llmgptqposttraining

研究学习 / 检索整理

optimizing-attention-flash

optimizing-attention-flash

198

Optimizes transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Use when training/running transformers with long sequences…

Stars 8,483
uioptimizingattentionflash

研究学习 / 检索整理

nanogpt

nanogpt

198

Educational GPT implementation in ~300 lines. Reproduces GPT-2 (124M) on OpenWebText. Clean, hackable code for learning transformers. By Andrej Karpathy.…

Stars 8,482
designuinanogpteducational

研究学习 / 检索整理

skypilot-multi-cloud-orchestration

skypilot-multi-cloud-orchestration

198

Multi-cloud orchestration for ML workloads with automatic cost optimization. Use when you need to run training or batch jobs across multiple clouds, leverage…

Stars 8,489
uiragazureskypilot

研究学习 / 检索整理

peft-fine-tuning

peft-fine-tuning

198

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need…

Stars 8,484
llmpeftfinetuning

研究学习 / 检索整理

pytorch-fsdp2

pytorch-fsdp2

198

Adds PyTorch FSDP2 (fully_shard) to training scripts with correct init, sharding, mixed precision/offload config, and distributed checkpointing. Use when…

Stars 8,486
apiagentpytorchfsdp2

研究学习 / 检索整理

pinecone

pinecone

197

Managed vector database for production AI applications. Fully managed, auto-scaling, with hybrid search (dense + sparse), metadata filtering, and namespaces.…

Stars 8,485
uidatabaseragpinecone

研究学习 / 检索整理

mamba-architecture

mamba-architecture

197

State-space model with O(n) complexity vs Transformers' O(n²). 5× faster inference, million-token sequences, no KV cache. Selective SSM with hardware-aware…

Stars 8,478
uiuxmambaarchitecture

研究学习 / 检索整理

phoenix-observability

phoenix-observability

197

Open-source AI observability platform for LLM tracing, evaluation, and monitoring. Use when debugging LLM applications with detailed traces, running…

Stars 8,485
uillmpromptdebugging

研究学习 / 检索整理

blip-2-vision-language

blip-2-vision-language

197

Vision-language pre-training framework bridging frozen image encoders and LLMs. Use when you need image captioning, visual question answering, image-text…

Stars 8,471
uiragllmblip

研究学习 / 检索整理

lambda-labs-gpu-cloud

lambda-labs-gpu-cloud

197

Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with simple SSH access, persistent…

Stars 8,478
uiperformanceraglambda

研究学习 / 检索整理

model-pruning

model-pruning

196

Reduce LLM size and accelerate inference using pruning techniques like Wanda and SparseGPT. Use when compressing models without retraining, achieving 50%…

Stars 8,482
llmmodelpruningreduce

研究学习 / 检索整理

nnsight-remote-interpretability

nnsight-remote-interpretability

196

Provides guidance for interpreting and manipulating neural network internals using nnsight with optional NDIF remote execution. Use when needing to run…

Stars 8,482
uigithubnnsightremote

研究学习 / 检索整理

segment-anything-model

segment-anything-model

196

Foundation model for image segmentation with zero-shot transfer. Use when you need to segment any object in images using points, boxes, or masks as prompts, or…

Stars 8,487
uipromptsegmentanything

研究学习 / 检索整理

rwkv-architecture

rwkv-architecture

196

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux…

Stars 8,487
uiuxrwkvarchitecture

研究学习 / 检索整理

evaluating-code-models

evaluating-code-models

195

Evaluates code generation models across HumanEval, MBPP, MultiPL-E, and 15+ benchmarks with pass@k metrics. Use when benchmarking code models, comparing coding…

Stars 8,475
uigithubevaluatingmodels

研究学习 / 检索整理

ln-613-code-comments-auditor

ln-613-code-comments-auditor

194

Checks inline code documentation quality: WHY-not-WHAT, density, forbidden content, docstrings quality, actuality, legacy cleanup. Use when auditing comments…

Stars 465
designaudit613comments

33 / 51