Version 1.1.0 Stable Local-First EDA

Instant Dataset Profiling.
Zero Python Scripts Required.

DataPrism is a local-first Exploratory Data Analysis (EDA) tool. Drop in any CSV or JSON file and immediately audit data quality, outliers, missing fields, and correlation matrices right in your browser or inside VS Code.

Experience DataPrism Live

Drop a CSV or JSON file in the mockup below to run our full analysis engine entirely client-side. No data is ever uploaded to any server.

employees.csv ×
Source: employees.csv -
DataPrism Profiler Dashboard
Total Rows
-
Total Columns
-
Missing Cells
-
Duplicate Rows
-
Descriptive Column Profiles
Column Type Nulls (%) Uniques Mean / Top Min / Freq Max / Pct Std Dev

Inferred Dataset Overview

Analyzing dataset structure...

Feature Engineering Suggestions

Missing Value Imputation Recipes

Correlation Analysis

This heatmap displays the Pearson correlation coefficients between numeric attributes. Hover over blocks to inspect values.

-1.0
+1.0

Composite Data Health Score

100
Cleanliness Grade

Evaluating data properties...

Deduction Audit Log

Detected Outliers (IQR Method)

Column Outliers % Extreme Values Sample
Python Profiler Execution Logs
Welcome to DataPrism. Loading dataset workspace...
Engine Features & Capabilities

DataPrism incorporates a high-fidelity diagnostic engine running fully client-side to summarize, clean, and enrich datasets.

📊

Descriptive Math Engine

Computes Fisher-Pearson skewness, standard deviation, and quartile interpolations client-side with zero latency.

🛡️

Outlier Auditing

Applies Tukey's fences (IQR bounds) to identify anomalies, list indexes, and score overall dataset cleanliness.

💡

Feature Ideas & Recipes

Flags target leakage, recommends scaling/log transforms, and writes copyable Python pandas recipes for data preparation.

🔗

Correlation Matrix

Builds full Spearman and Pearson correlation grids and renders interactive heatmap visualizations natively inside your IDE.

Descriptive Math Engine

DataPrism conducts all numerical computations locally. Key algorithms include:

Standard Deviation ($\sigma$)

$$\sigma = \sqrt{\frac{1}{N} \sum_{i=1}^{N} (x_i - \mu)^2}$$

Quantifies the variance of distribution values around the arithmetic mean $\mu$.

Fisher-Pearson Skewness ($g_1$)

$$g_1 = \frac{\frac{1}{N} \sum_{i=1}^{N} (x_i - \mu)^3}{\sigma^3}$$

Determines asymmetry. If skewness is high, DataPrism recommends log transforms.

Outlier IQR Thresholds

$$[Q_{25} - 1.5 \times \text{IQR},\, Q_{75} + 1.5 \times \text{IQR}]$$

Values outside these boundaries are identified as anomalies and recorded in the audit logs.

Install Extension inside VS Code

DataPrism runs completely offline as a standard extension. Add it to your editor workspace now.

VS Code Marketplace
ext install CODExGAMERZ.dataprism