搜索结果
全部能力
找到 92 个相关结果 / 测试与 QA
研究学习 / 检索整理
ln-520-test-planner
ln-520-test-planner
Orchestrates test planning pipeline: research, manual testing, automated test planning. Use when Story needs comprehensive test coverage planning.
研究学习 / 检索整理
ln-510-quality-coordinator
ln-510-quality-coordinator
Use when coordinating story quality evaluation with mandatory research, worker summaries, agent review, regression evidence, and bounded refinement.
研究学习 / 检索整理
ln-521-test-researcher
ln-521-test-researcher
Researches real-world problems, competitor solutions, and customer complaints for a feature domain. Use before test planning to ground tests in actual user…
研究学习 / 检索整理
paper-review
paper-review
Guides self-review of YOUR OWN academic paper before submission with adversarial stress-testing. Core method: 5-aspect checklist (contribution sufficiency,…
研究学习 / 检索整理
statsmodels
statsmodels
Statistical models library for Python. Use when you need specific model classes (OLS, GLM, mixed models, ARIMA) with detailed diagnostics, residuals, and…
研究学习 / 检索整理
hypothesis-generation
hypothesis-generation
Structured hypothesis formulation from observations. Use when you have experimental observations or data and need to formulate testable hypotheses with…
研究学习 / 检索整理
networkx
networkx
Comprehensive toolkit for creating, analyzing, and visualizing complex networks and graphs in Python. Use when working with network/graph data structures,…
研究学习 / 检索整理
dingtalk-todo
dingtalk-todo
钉钉待办管理。当用户提到"钉钉待办"、"待办任务"、"创建待办"、"新增待办"、"查看待办"、"完成待办"、"标记完成"、"删除待办"、"待办列表"、"我的待办"、"设置截止时间"、"指派待办"、"dingtalk todo"、"todo task"、"task…
研究学习 / 检索整理
alicloud-platform-multicloud-docs-api-benchmark
alicloud-platform-multicloud-docs-api-benchmark
对标阿里云、AWS、Azure、GCP、腾讯云、火山引擎和华为云的同类产品文档及 API 文档。给定一个…
研究学习 / 检索整理
pymoo
pymoo
Multi-objective optimization framework. NSGA-II, NSGA-III, MOEA/D, Pareto fronts, constraint handling, benchmarks (ZDT, DTLZ), for engineering design and…
研究学习 / 检索整理
pydeseq2
pydeseq2
Differential gene expression analysis (Python DESeq2). Identify DE genes from bulk RNA-seq counts, Wald tests, FDR correction, volcano/MA plots, for RNA-seq…
研究学习 / 检索整理
math-reasoning
math-reasoning
Formal mathematical reasoning for research papers — derive equations, write proofs, formalize problem settings, select statistical tests, and generate LaTeX…
研究学习 / 检索整理
axiom-apple-docs-research
axiom-apple-docs-research
Use when researching Apple frameworks, APIs, or WWDC sessions - provides techniques for retrieving full transcripts, code samples, and documentation using…
研究学习 / 检索整理
k6-docs
k6-docs
Use when writing or reviewing k6 documentation across TypeScript types, user docs, and release notes.
研究学习 / 检索整理
evaluating-llms-harness
evaluating-llms-harness
Evaluates LLMs across 60+ academic benchmarks (MMLU, HumanEval, GSM8K, TruthfulQA, HellaSwag). Use when benchmarking model quality, comparing models, reporting…
研究学习 / 检索整理
gitnexus-pr-review
gitnexus-pr-review
Use when the user wants to review a pull request, understand what a PR changes, assess risk of merging, or check for missing test coverage. Examples: \"Review…
研究学习 / 检索整理
evaluating-code-models
evaluating-code-models
Evaluates code generation models across HumanEval, MBPP, MultiPL-E, and 15+ benchmarks with pass@k metrics. Use when benchmarking code models, comparing coding…
研究学习 / 检索整理
kimi-pdf
kimi-pdf
Professional PDF solution. Create PDFs using HTML+Paged.js (academic papers, reports, documents). Process existing PDFs using Python (read, extract, merge,…
研究学习 / 检索整理
dingtalk-contact
dingtalk-contact
钉钉通讯录与联系人查询。当用户提到"钉钉通讯录"、"查找员工"、"搜索用户"、"查用户信息"、"获取用户详情"、"用户手机号"、"员工姓名"、"员工工号"、"查部门"、"搜索部门"、"部门成员"、"部门列表"、"部门详情"、"子部门"、"父部门"、"部门路径"、"员工总数"、"通讯录搜索"、"userId 转…
研究学习 / 检索整理
memory-systems
memory-systems
Guides implementation of agent memory systems, compares production frameworks (Mem0, Zep/Graphiti, Letta, LangMem, Cognee), and designs persistence architectures for cross-session knowledge retention. Use when the user asks to "implement agent memory", "persist state across sessions", "build knowledge graph for agents", "track entities over time", "add long-term memory", "choose a memory framework", or mentions temporal knowledge graphs, vector stores, entity memory, adaptive memory, dynamic memory or memory benchmarks (LoCoMo, LongMemEval).