Marketplace

Find the right skill for the job.

Browse the full catalog through outcome-first channels, technical facets, rating filters, and server-side pagination built for a large public marketplace.

Start with the job to be done
3655 results
Analyze Data & AI · Data / AI
Page 47 of 153
SkillHub ClubAnalyze Data & AI

data-exploration

Profile and explore datasets to understand their shape, quality, and patterns before analysis. Use when encountering a new dataset, assessing data quality, discovering column distributions, identifying nulls and outliers, or deciding which dimensions to analyze.

C 4.5
Full StackData / AI
10K
rank 88
hot 99
SkillHub ClubAnalyze Data & AI

instrument-data-to-allotrope

Convert laboratory instrument output files (PDF, CSV, Excel, TXT) to Allotrope Simple Model (ASM) JSON format or flattened 2D CSV. Use this skill when scientists need to standardize instrument data for LIMS systems, data lakes, or downstream analysis. Supports auto-detection of instrument types. Outputs include full ASM JSON, flattened CSV for easy import, and exportable Python code for data engineers. Common triggers include converting instrument files, standardizing lab data, preparing data for upload to LIMS/ELN systems, or generating parser code for production pipelines.

C 4.5
Full StackData / AI
10K
rank 88
hot 99
SkillHub ClubAnalyze Data & AI

nextflow-development

Run nf-core bioinformatics pipelines (rnaseq, sarek, atacseq) on sequencing data. Use when analyzing RNA-seq, WGS/WES, or ATAC-seq data—either local FASTQs or public datasets from GEO/SRA. Triggers on nf-core, Nextflow, FASTQ analysis, variant calling, gene expression, differential expression, GEO reanalysis, GSE/GSM/SRR accessions, or samplesheet creation.

C 4.5
Full StackData / AI
10K
rank 88
hot 99
SkillHub ClubAnalyze Data & AI

single-cell-rna-qc

Performs quality control on single-cell RNA-seq data (.h5ad or .h5 files) using scverse best practices with MAD-based filtering and comprehensive visualizations. Use when users request QC analysis, filtering low-quality cells, assessing data quality, or following scverse/scanpy best practices for single-cell analysis.

C 4.5
Full StackData / AI
10K
rank 88
hot 99
SkillHub ClubAnalyze Data & AI

scvi-tools

Deep learning for single-cell analysis using scvi-tools. This skill should be used when users need (1) data integration and batch correction with scVI/scANVI, (2) ATAC-seq analysis with PeakVI, (3) CITE-seq multi-modal analysis with totalVI, (4) multiome RNA+ATAC analysis with MultiVI, (5) spatial transcriptomics deconvolution with DestVI, (6) label transfer and reference mapping with scANVI/scArches, (7) RNA velocity with veloVI, or (8) any deep learning-based single-cell method. Triggers include mentions of scVI, scANVI, totalVI, PeakVI, MultiVI, DestVI, veloVI, sysVI, scArches, variational autoencoder, VAE, batch correction, data integration, multi-modal, CITE-seq, multiome, reference mapping, latent space.

C 4.5
Full StackData / AIIntegration
9.8K
rank 88
hot 99
SkillHub ClubAnalyze Data & AI

testing-agent

Run goal-based evaluation tests for agents. Use when you need to verify an agent meets its goals, debug failing tests, or iterate on agent improvements based on test results.

C 4.5
Full StackData / AITesting
9.6K
rank 88
hot 99
SkillHub ClubAnalyze Data & AI

hf-mcp

Use Hugging Face Hub via MCP server tools. Search models, datasets, Spaces, papers. Get repo details, fetch documentation, run compute jobs, and use Gradio Spaces as AI tools. Available when connected to the HF MCP server.

C 4.5
Full StackBackendData / AI
9.3K
rank 88
hot 99
SkillHub ClubAnalyze Data & AI

dev-browser

Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "test the website", "log into", or any browser interaction request.

C 4.5
Full StackData / AITesting
7.4K
rank 86
hot 99
SkillHub ClubAnalyze Data & AI

pinecone

Managed vector database for production AI applications. Fully managed, auto-scaling, with hybrid search (dense + sparse), metadata filtering, and namespaces. Low latency (<100ms p95). Use for production RAG, recommendation systems, or semantic search at scale. Best for serverless, managed infrastructure.

C 4.5
Full StackBackendData / AI
RAGPineconeVector DatabaseManaged Service
5.2K
rank 85
hot 99
SkillHub ClubAnalyze Data & AI

pytorch-fsdp2

Adds PyTorch FSDP2 (fully_shard) to training scripts with correct init, sharding, mixed precision/offload config, and distributed checkpointing. Use when models exceed single-GPU memory or when you need DTensor-based sharding with DeviceMesh.

C 4.5
Full StackData / AI
PyTorchFSDP2Fully Sharded Data ParallelDistributed Training
5.2K
rank 85
hot 99
SkillHub ClubAnalyze Data & AI

model-merging

Merge multiple fine-tuned models using mergekit to combine capabilities without retraining. Use when creating specialized models by blending domain-specific expertise (math + coding + chat), improving performance beyond single models, or experimenting rapidly with model variants. Covers SLERP, TIES-Merging, DARE, Task Arithmetic, linear merging, and production deployment strategies.

C 4.5
Full StackDevOpsData / AI
Emerging TechniquesModel MergingMergekitSLERP
5.2K
rank 85
hot 99
SkillHub ClubAnalyze Data & AI

rwkv-architecture

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux Foundation AI project. Production at Windows, Office, NeMo. RWKV-7 (March 2025). Models up to 14B parameters.

C 4.5
Full StackData / AI
RWKVModel ArchitectureRNNTransformer Hybrid
5.2K
rank 85
hot 99
SkillHub ClubAnalyze Data & AI

modal-serverless-gpu

Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure management, deploying ML models as APIs, or running batch jobs with automatic scaling.

C 4.5
Full StackDevOpsData / AI
InfrastructureServerlessGPUCloud
5.2K
rank 85
hot 99
SkillHub ClubAnalyze Data & AI

llamaindex

Data framework for building LLM applications with RAG. Specializes in document ingestion (300+ connectors), indexing, and querying. Features vector indices, query engines, agents, and multi-modal support. Use for document Q&A, chatbots, knowledge retrieval, or building RAG pipelines. Best for data-centric LLM applications.

C 4.5
Full StackData / AI
AgentsLlamaIndexRAGDocument Ingestion
5.2K
rank 85
hot 99
SkillHub ClubAnalyze Data & AI

slime-rl-training

Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models, implementing custom data generation workflows, or needing tight Megatron-LM integration for RL scaling.

C 4.5
Full StackData / AIIntegration
Reinforcement LearningMegatron-LMSGLangGRPO
5.2K
rank 85
hot 99
SkillHub ClubAnalyze Data & AI

chroma

Open-source embedding database for AI applications. Store embeddings and metadata, perform vector and full-text search, filter by metadata. Simple 4-function API. Scales from notebooks to production clusters. Use for semantic search, RAG applications, or document retrieval. Best for local development and open-source projects.

C 4.5
Full StackBackendData / AI
RAGChromaVector DatabaseEmbeddings
5.2K
rank 85
hot 99
SkillHub ClubAnalyze Data & AI

hunt-analytics-generation

Generate query-agnostic analytics that model adversary behavior by translating hunt investigative intent into analytic definitions grounded in schema semantics. This skill is used to define how behavior should manifest in data before query execution or validation, and works best when informed by system internals, adversary tradecraft, a structured hunt focus, and suggested data sources.

C 4.5
Full StackData / AI
4.5K
rank 84
hot 99
SkillHub ClubAnalyze Data & AI

platform

Imported from https://github.com/archestra-ai/archestra.

C 4.5
Full StackData / AI
3.5K
rank 83
hot 99
SkillHub ClubAnalyze Data & AI

agentic-context-engine

Imported from https://www.skillhub.club/skills/kayba-ai-agentic-context-engine.

C 4.5
Full StackData / AI
2K
rank 80
hot 99
SkillHub ClubAnalyze Data & AI

rdc-react-testing

Test @data-client/react with renderDataHook - jest unit tests, fixtures, interceptors, mock responses, nock HTTP mocking, hook testing, component testing

C 4.5
Full StackFrontendData / AI
2K
rank 80
hot 99
SkillHub ClubAnalyze Data & AI

subagent-creator

Guide for creating AI subagents with isolated context for complex multi-step workflows. Use when users want to create a subagent, specialized agent, verifier, debugger, or orchestrator that requires isolated context and deep specialization. Works with any agent that supports subagent delegation. Triggers on "create subagent", "new agent", "specialized assistant", "create verifier".

C 4.5
Full StackData / AI
1.8K
rank 79
hot 99
SkillHub ClubAnalyze Data & AI

apify-actor-development

Develop, debug, and deploy Apify Actors - serverless cloud programs for web scraping, automation, and data processing. Use when creating new Actors, modifying existing ones, or troubleshooting Actor code.

C 4.5
Full StackDevOpsData / AI
1.7K
rank 79
hot 99
SkillHub ClubAnalyze Data & AI

create-mcp-servers

Create Model Context Protocol (MCP) servers that expose tools, resources, and prompts to Claude. Use when building custom integrations, APIs, data sources, or any server that Claude should interact with via the MCP protocol. Supports both TypeScript and Python implementations.

C 4.5
Full StackBackendData / AI
1.6K
rank 78
hot 99
SkillHub ClubAnalyze Data & AI

skill-creator

Creates new AI agent skills following the Agent Skills spec. Trigger: When user asks to create a new skill, add agent instructions, or document patterns for AI.

C 4.5
Full StackData / AI
1.5K
rank 78
hot 99
Previous
Page 47 of 153
Next