全部软件工程通用助手设计与产品研究学习安全与治理自动化与集成内容创作数据与分析办公运营

搜索结果

全部能力

找到 1247 个相关结果 / 研究学习

研究学习 / 检索整理

memory-protocol

Persistent cross-session memory using Memento MCP knowledge graph (mcp__memento__* tools). Recall-before-acting: search memory before starting tasks, on errors, and when receiving corrections. Multi-dimensional search: two queries per recall event (technical topic + process/workflow learnings). Store-after-discovery: persist solutions, conventions, and corrections immediately. Three-step recall: search, open_nodes, traverse relations. WORKING_STATE.md for crash recovery. Self-reminder protocol every 5-10 messages. Activate on task start, errors, corrections, session boundaries, or explicit memory requests. See references/agents-md-setup.md for AGENTS.md integration.

Stars 2

agentagentsworkflowdebugging

研究学习 / 检索整理

openrlhf-training

201

High-performance RLHF framework with Ray+vLLM acceleration. Use for PPO, GRPO, RLOO, DPO training of large models (7B-70B+). Built on Ray, vLLM, ZeRO-3. 2×…

Stars 8,482

uiperformancedockerllm

研究学习 / 检索整理

speculative-decoding

201

Accelerate LLM inference using speculative decoding, Medusa multiple heads, and lookahead decoding techniques. Use when optimizing inference speed (1.5-3.6×…

Stars 8,489

llmspeculativedecodingaccelerate

研究学习 / 检索整理

grpo-rl-training

201

Expert guidance for GRPO/RL fine-tuning with TRL for reasoning and task-specific model training

Stars 8,475

uiworkflowgrpotraining

研究学习 / 检索整理

guidance

200

Control LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance…

Stars 8,475

uillmpromptworkflow

研究学习 / 检索整理

nsfc-research-content-writer

200

当用户明确要求"写/改研究内容""研究内容+创新+年度计划编排"时使用。为 NSFC 正文"（二）研究内容"写作/重构，并同步编排"特色与创新"和"三年年度研究计划"，输出可直接落到 LaTeX 模板的三个 extraTex 文件。

Stars 1,580

nsfcresearchcontentwriter

研究学习 / 检索整理

gguf-quantization

200

GGUF format and llama.cpp quantization for efficient CPU/GPU inference. Use when deploying models on consumer hardware, Apple Silicon, or when needing flexible…

Stars 8,475

uiggufquantizationformat

研究学习 / 检索整理

evaluating-llms-harness

200

Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting…

Stars 8,475

uillmpromptevaluating

研究学习 / 检索整理

awq-quantization

200

Activation-aware weight quantization for 4-bit LLM compression with 3x speedup and minimal accuracy loss. Use when deploying large models (7B-70B) on limited…

Stars 8,471

backendllmawqquantization

研究学习 / 检索整理

AI 研究复现

ai-research-reproduction

200

README 优先的 AI 仓库复现主编排器。当用户需要端到端、最小可信的复现流程，且该流程会读取仓库…

Stars 103

uiauditagentresearch

研究学习 / 检索整理

instructor

200

Extract structured data from LLM responses with Pydantic validation, retry failed extractions automatically, parse complex JSON with type safety, and stream…

Stars 8,476

apillmgithubinstructor

研究学习 / 检索整理

outlines

200

Guarantee valid JSON/XML/code structure during generation, use Pydantic models for type-safe outputs, support local models (Transformers, vLLM), and maximize…

Stars 8,483

llmgithuboutlinesguarantee

研究学习 / 检索整理

constitutional-ai

200

Anthropic's method for training harmless AI through self-improvement. Two-phase approach - supervised learning with self-critique/revision, then RLAIF (RL from…

Stars 8,473

uiconstitutionalanthropicmethod

研究学习 / 检索整理

adk-docs

200

创建、审阅、更新和搜索 ADK 文档的指南 - 当用户询问编写、维护或审计 ADK 机器人文档时使用

Stars 0

uiauditworkflowadk

研究学习 / 检索整理

pytorch-lightning

200

High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system, and minimal boilerplate. Scales from…

Stars 8,486

uipytorchlightninglevel

研究学习 / 检索整理

sparse-autoencoder-training

200

Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features. Use…

Stars 8,489

uigithubsparseautoencoder

研究学习 / 检索整理

long-context

199

Extend context windows of transformer models using RoPE, YaRN, ALiBi, and position interpolation techniques. Use when processing long documents (32k-128k+…

Stars 8,478

longcontextextendwindows

研究学习 / 检索整理

distributed-llm-pretraining-torchtitan

199

Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use when pretraining Llama 3.1, DeepSeek V3, or…

Stars 8,474

uillmgithubdistributed

研究学习 / 检索整理

moe-training

199

Train Mixture of Experts (MoE) models using DeepSpeed or HuggingFace. Use when training large-scale models with limited compute (5× cost reduction vs dense…

Stars 8,482

performancemoetrainingtrain

研究学习 / 检索整理

fine-tuning-with-trl

199

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward…

Stars 8,475

uillmfinetuning

第 40 / 63 页