Skill Detail

Parquet to PostgreSQL Loader

Reads Apache Parquet files using PyArrow and bulk-loads them into PostgreSQL via psycopg2 COPY protocol. Handles schema mapping, partitioned datasets, and incremental upserts with conflict resolution.

Data Extraction & TransformationClaude Agents

Data Extraction & Transformation Claude Agents Published

Tool match: parquet

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill parquet-to-postgresql-loader Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source Documentation

At a glance

Author

Apache Software Foundation

Last updated

Mar 24, 2026

Quick brief

The Parquet to PostgreSQL Loader skill enables high-performance ingestion of Apache Parquet files into PostgreSQL databases. Built on PyArrow for Parquet reading and psycopg2 for database interaction, it leverages the COPY protocol for maximum throughput rather than row-by-row INSERT statements. The skill automatically maps Parquet schema types to PostgreSQL column types, handling complex conversions like nested structs to JSONB, timestamps with timezone awareness, and decimal precision preservation. For partitioned Parquet datasets (common in data lake architectures), it discovers and processes all partition files while preserving partition column values. Incremental loading is supported through configurable upsert strategies using PostgreSQL ON CONFLICT clauses — choose between UPDATE, IGNORE, or custom merge logic. The loader creates target tables automatically when they do not exist, with proper indexes on partition and primary key columns. Progress reporting, checkpointing for resumable loads, and dry-run mode for schema verification round out the feature set. Compatible with Parquet files from Spark, Athena, BigQuery exports, and DuckDB.

Best fit

When to reach for it

Best when the job fits Data Extraction & Transformation.
Works naturally with Claude Agents setups.

Trust & provenance

Why this listing is credible

Built around the parquet toolchain.
Trust status: Published.
Last updated Mar 24, 2026.

View source ↗ Documentation ↗