搜索结果
全部能力
找到 1009 个相关结果 / 研究资料
研究学习 / 检索整理
pyvene-interventions
pyvene-interventions
Provides guidance for performing causal interventions on PyTorch models using pyvene's declarative intervention framework. Use when conducting causal tracing,…
研究学习 / 检索整理
adk-docs
adk-docs
创建、审阅、更新和搜索 ADK 文档的指南 - 当用户询问编写、维护或审计 ADK 机器人文档时使用
研究学习 / 检索整理
evaluating-llms-harness
evaluating-llms-harness
Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting…
研究学习 / 检索整理
instructor
instructor
Extract structured data from LLM responses with Pydantic validation, retry failed extractions automatically, parse complex JSON with type safety, and stream…
研究学习 / 检索整理
sparse-autoencoder-training
sparse-autoencoder-training
Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use…
研究学习 / 检索整理
gguf-quantization
gguf-quantization
GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible…
研究学习 / 检索整理
awq-quantization
awq-quantization
Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited…
研究学习 / 检索整理
constitutional-ai
constitutional-ai
Anthropic's method for training harmless AI through self-improvement. Two-phase approach - supervised learning with self-critique/revision, then RLAIF (RL from…
研究学习 / 检索整理
pytorch-lightning
pytorch-lightning
High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from…
研究学习 / 检索整理
AI 研究复现
ai-research-reproduction
README 优先的 AI 仓库复现主编排器。当用户需要端到端、最小可信的复现流程,且该流程会读取仓库…
研究学习 / 检索整理
nsfc-research-content-writer
nsfc-research-content-writer
当用户明确要求"写/改研究内容""研究内容+创新+年度计划编排"时使用。为 NSFC 正文"(二)研究内容"写作/重构,并同步编排"特色与创新"和"三年年度研究计划",输出可直接落到 LaTeX 模板的三个 extraTex 文件。
研究学习 / 检索整理
guidance
guidance
Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance…
研究学习 / 检索整理
outlines
outlines
Guarantee valid JSON/XML/code structure during generation, use Pydantic models for type-safe outputs, support local models (Transformers, vLLM), and maximize…
研究学习 / 检索整理
slime-rl-training
slime-rl-training
Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation…
研究学习 / 检索整理
sentencepiece
sentencepiece
Language-independent tokenizer treating text as raw Unicode. Supports BPE and Unigram algorithms. Fast (50k sentences/sec), lightweight (6MB memory),…
研究学习 / 检索整理
distributed-llm-pretraining-torchtitan
distributed-llm-pretraining-torchtitan
Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use when pretraining Llama 3.1, DeepSeek V3, or…
研究学习 / 检索整理
moe-training
moe-training
Train Mixture of Experts (MoE) models using DeepSpeed or HuggingFace. Use when training large-scale models with limited compute (5× cost reduction vs dense…
研究学习 / 检索整理
paper-self-review
paper-self-review
This skill should be used when the user asks to "review paper quality", "check paper completeness", "validate paper structure", "self-review before…
研究学习 / 检索整理
long-context
long-context
Extend context windows of transformer models using RoPE, YaRN, ALiBi, and position interpolation techniques. Use when processing long documents (32k-128k+…
研究学习 / 检索整理
simpo-training
simpo-training
Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points on AlpacaEval 2.0). No reference model…