Pandas DataFrame Schema Validator
Validates Pandas DataFrame structures using the pandera library API and pa.DataFrameSchema definitions. Enforces column types, nullable constraints, and custom check functions via pandera.Check.
Validates Pandas DataFrame structures using the pandera library API and pa.DataFrameSchema definitions. Enforces column types, nullable constraints, and custom check functions via pandera.Check.
npx skills add agentskillexchange/skills --skill pandas-dataframe-schema-validator
The Pandas DataFrame Schema Validator enforces data quality rules on Pandas DataFrames using the pandera schema validation library. It defines validation schemas using pa.DataFrameSchema with column-level type enforcement, nullable constraints, and value range specifications via pandera.Column definitions. The validator supports custom check functions using pandera.Check with lambda expressions, regex patterns, and statistical validations for distribution testing. It integrates with the pandas.api.types module to validate dtype compatibility across DataFrame operations. Schema inference is available using pandera.infer_schema() to generate baseline schemas from sample data, which can then be customized and tightened. The skill validates multi-index DataFrames, handles categorical dtype enforcement, and supports hypothesis testing checks using pandera.Hypothesis for statistical property validation. Error reporting generates detailed failure summaries with row indices, failing values, and schema violation categories.