⚠️Install with caution. This skill has very few installs. Always review the source and verify it on clawhub.ai before installing. Community-built skills run with agent permissions — only install ones you trust.

🛠️ Utilities & Tools

Dataset Finderv0.1.0

Name: Dataset Finder
Author: anisafifi

dataset-finder

anisafifi

Use this skill when users need to search for datasets, download data files, or explore data repositories. Triggers include: requests to "find datasets", "search for data", "download dataset from Kaggle", "get data from Hugging Face", "find ML datasets", or mentions of data repositories like Kaggle, UCI ML Repository, Data.gov, or Hugging Face. Also use for previewing dataset statistics, generating data cards, or discovering datasets for machine learning projects. Requires OpenClawCLI installation from clawhub.ai.

Download Package View on ClawHub

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install anisafifi/dataset-finder

Skill Package4 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

A68/100

Grade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation12/35

· 5 installs (low)
· 902 downloads (moderate demand)

Documentation20/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Contains usage examples or trigger description
· Detailed summary

Package Completeness11/15

· skillAssets present (3 files)

Security Analysis

💙 Low Risk

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://clawhub.ai/

Audited Apr 16, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 1, 2026

Data ScientistsAcademic ResearchersAI Engineersbeginner

💡 Application Scenarios

Academic Research Data DiscoveryEducation/Research

Researchers in universities or labs need to find benchmark datasets for machine learning experiments, such as classification or regression tasks from the UCI ML Repository. This skill helps them quickly search, preview statistics, and download datasets in various formats without manual browsing, accelerating literature review and experimental setup.

Data Science Project SourcingTechnology/Consulting

Data scientists and analysts working on commercial projects, like predictive modeling for housing prices, use this skill to search Kaggle and Hugging Face for relevant datasets. It enables filtering by file type or license, downloading data directly, and generating data cards for documentation, streamlining the initial data acquisition phase.

Government Open Data AnalysisGovernment/Non-Profit

Policy analysts or civic tech developers need to access and explore public datasets from Data.gov for projects like economic trend analysis or environmental monitoring. This skill allows searching across repositories, previewing dataset shapes and missing values, and managing downloads locally, facilitating transparent data-driven insights.

NLP Model TrainingArtificial Intelligence

AI engineers building natural language processing models, such as sentiment analysis tools, use this skill to find and download text datasets from Hugging Face. It supports filtering by task and language, streaming large datasets, and generating usage examples, reducing time spent on data preparation for model training.

Educational Curriculum DevelopmentEdTech

Instructors or online course creators designing machine learning tutorials need curated datasets for hands-on exercises. This skill helps search for datasets like IMDB reviews, preview basic statistics, and list local files, ensuring students have accessible, well-documented data for learning projects in classrooms or self-paced courses.

💼 Business Models

Freemium SaaS for Data TeamsSubscription fees

Offer a cloud-based version with basic search and preview features for free, while charging for advanced analytics, team collaboration tools, and API access to premium datasets. Revenue comes from subscription tiers based on usage volume and enterprise support, targeting mid-sized tech companies.

Enterprise Data IntegrationLicensing and maintenance fees

License the skill as part of a larger data platform for corporations, integrating with internal data lakes and workflow systems. Revenue is generated through one-time licensing fees and annual maintenance contracts, with customization for specific industries like finance or healthcare.

Consulting and Training ServicesService fees

Provide paid workshops and consulting sessions to help organizations implement the skill for data discovery and management. Revenue comes from hourly rates or project-based fees, focusing on upskilling teams in data science and optimizing dataset usage for machine learning projects.

💬 Integration Tip

Ensure OpenClawCLI is installed first, and set up API credentials for Kaggle and Hugging Face to enable full functionality across repositories.