Package: dlookr 0.6.6

dlookr: Tools for Data Diagnosis, Exploration, Transformation

A collection of tools that support data diagnosis, exploration, and transformation. Data diagnostics provides information and visualization of missing values, outliers, and unique and negative values to help you understand the distribution and quality of your data. Data exploration provides information and visualization of the descriptive statistics of univariate variables, normality tests and outliers, correlation of two variables, and the relationship between the target variable and predictor. Data transformation supports binning for categorizing continuous variables, imputes missing values and outliers, and resolves skewness. And it creates automated reports that support these three tasks.

Authors:Choonghyun Ryu [aut, cre]

dlookr_0.6.6.tar.gz
dlookr_0.6.6.zip(r-4.7)dlookr_0.6.6.zip(r-4.6)dlookr_0.6.6.zip(r-4.5)
dlookr_0.6.6.tgz(r-4.6-any)dlookr_0.6.6.tgz(r-4.5-any)
dlookr_0.6.6.tar.gz(r-4.7-any)dlookr_0.6.6.tar.gz(r-4.6-any)
dlookr_0.6.6.tgz(r-4.6-emscripten)
manual.pdf |manual.html
DESCRIPTION |NEWS
card.svg |card.png
dlookr/json (API)

# Install 'dlookr' in R:
install.packages('dlookr', repos = c('https://choonghyunryu.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/choonghyunryu/dlookr/issues

Pkgdown/docs site:https://choonghyunryu.github.io

Datasets:

On CRAN:

Conda:

10.94 score 215 stars 3 packages 758 scripts 1.9k downloads 2 mentions 63 exports 120 dependencies

Last updated from:759b7b01d9. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK279
source / vignettesOK287
linux-release-x86_64OK241
macos-release-arm64OK235
macos-oldrel-arm64OK228
windows-develOK204
windows-releaseOK194
windows-oldrelOK186
wasm-releaseOK166

Exports:binningbinning_bybinning_rgrcompare_categorycompare_numericcorrelatecramerdescribediagnosediagnose_categorydiagnose_numericdiagnose_outlierdiagnose_paged_reportdiagnose_reportdiagnose_sparesediagnose_web_reportdlookr_blue_pageddlookr_orange_pageddlookr_templ_htmleda_paged_reporteda_reporteda_web_reportentropyextractfind_classfind_nafind_outliersfind_skewnessget_classget_column_infoget_osget_percentileget_transformimport_google_fontimputate_naimputate_outlierjsdkldkurtosisnormalityoverviewperformance_binplot_bar_categoryplot_box_numericplot_correlateplot_hist_numericplot_na_hclustplot_na_intersectplot_na_paretoplot_normalityplot_outlierplot_qq_numericppsrelateskewnesstarget_bytheiltransformtransformation_paged_reporttransformation_reporttransformation_web_reportunivar_categoryunivar_numeric

Dependencies:AsioHeadersbackportsbase64encbitbit64bookdownbootbroombslibcachemclicliprcodetoolscommonmarkcpp11crayondigestdplyrevaluatefarverfastmapfontawesomeforcatsforeachfsgenericsggplot2glmnetgluegridExtragtablehavenhighrhmshtmltoolshtmlwidgetshttpuvisobanditeratorsjomojquerylibjsonlitekableExtraknitrlabelinglaterlatticelifecyclelme4magrittrMASSMatrixmemoisemicemimeminqamitmlnlmenloptrnnetnumDerivordinalotelpagedownpanpillarpkgconfigprettyunitsprocessxprogresspromisespspurrrR6rappdirsrbibutilsRColorBrewerRcppRcppEigenRdpackreactablereactRreadrreformulasrlangrmarkdownrpartrstudioapiS7sassscalesservrshapeshinyshowtextshowtextdbsourcetoolsstringistringrsurvivalsvglitesysfontssystemfontstextshapingtibbletidyrtidyselecttinytextzdbucminfutf8vctrsviridisLitevroomwebsocketwithrxfunxml2xtableyaml

Data quality diagnosis
Preface | Supported data structures | Data: nycflights13 | Data diagnosis | General diagnosis of all variables with diagnose() | Diagnosis of numeric variables with diagnose_numeric() | Diagnosis of categorical variables with diagnose_category() | Diagnosing outliers with diagnose_outlier() | Visualization of outliers using plot_outlier() | Visualization for missing values | visualize pareto chart using plot_na_pareto() | visualize combination chart using plot_na_hclust() | visualize combination chart using plot_na_intersect() | Automated report | Create a diagnostic report using diagnose_web_report() | Contents of dynamic web report | Some arguments for dynamic web report | Screenshot of dynamic report | Create a diagnostic report using diagnose_paged_report() | Contents of static paged report | Some arguments for static paged report | Screenshot of static report | Diagnosing tables in DBMS | Preparing table data | Diagnose data quality of variables in the DBMS | Diagnose data quality of categorical variables in the DBMS | Diagnose data quality of numerical variables in the DBMS | Diagnose outlier of numerical variables in the DBMS | Plot outlier information of numerical data diagnosis in the DBMS | Reporting the information of data diagnosis for table of thr DBMS

Last update: 2025-08-28
Started: 2018-04-23

Data Transformation
Preface | datasets | Imputation of missing values | imputes the missing value with imputate_na() | Collaboration with dplyr | Impute outliers | imputes the outliers with imputate_outlier() | Standardization and Resolving Skewness | Introduction to the use of transform() | Standardization with transform() | Resolving Skewness data with transform() | Binning | Binning of individual variables using binning() | Optimal Binning with binning_by() | Automated report | Create a dynamic report using transformation_web_report() | Contents of dynamic web report | Some arguments for dynamic web report | Screenshot of dynamic report | Create a static report using transformation_paged_report() | Contents of static paged report | Some arguments for static paged report | Screenshot of static report

Last update: 2025-08-28
Started: 2018-04-23

Exploratory Data Analysis
Preface | Supported data structures | datasets | Univariate data EDA | Calculating descriptive statistics using describe() | Test of normality on numeric variables using normality() | Visualization of normality of numerical variables using plot_normality() | EDA of bivariate data | Calculation of correlation coefficient using correlate() | Visualization of the correlation matrix using plot.correlate() | EDA based on target variable | Definition of target variable | EDA when target variable is categorical variable | Cases where predictors are numeric variable | Cases where predictors are categorical variable | EDA when target variable is numerical variable | Automated report | Create a dynamic report using eda_web_report() | Contents of dynamic web report | Some arguments for dynamic web report | Screenshot of dynamic report | Create a EDA report using eda_paged_report() | Contents of static paged report | Some arguments for static paged report | Screenshot of static report | Exploratory data analysis for tables in DBMS | Preparing table data | Calculating descriptive statistics of numerical column of table in the DBMS | Test of normality on numeric columns using in the DBMS | Normalization visualization of numerical column in the DBMS | Compute the correlation coefficient between two columns of the table in DBMS | Visualize correlation plot of numerical columns in the DBMS | Reporting the information of EDA for table of the DBMS

Last update: 2025-08-28
Started: 2018-04-23

Introduce dlookr
Preface | Supported data structures | List of supported tasks of data analytics | Diagnose Data | Overall Diagnose Data | Visualize Missing Values | Reporting | EDA | Univariate EDA | Bivariate EDA | Normality Test | Relationship between target variable and predictors | Transform Data | Find Variables | Imputation | Binning | Diagnose Binned Variable | Transformation | Miscellaneous | Statistics | Programming

Last update: 2023-12-29
Started: 2020-09-02

Readme and manuals

Help Manual

Help pageTopics
dlookr: Tools for Data Diagnosis, Exploration, Transformationdlookr-package dlookr
Binning the Numeric Databinning
Optimal Binning for Scoring Modelingbinning_by
Binning by recursive information gain ratio maximizationbinning_rgr
Sales of Child Car SeatsCarseats
Compare categorical variablescompare_category compare_category.data.frame
Compare numerical variablescompare_numeric compare_numeric.data.frame
Compute the correlation coefficient between two variablecorrelate correlate.data.frame correlate.grouped_df correlate.tbl_dbi
Cramer's V statisticcramer
Compute descriptive statisticdescribe describe.data.frame describe.grouped_df
Compute descriptive statisticdescribe.tbl_dbi
Diagnose data quality of variablesdiagnose diagnose.data.frame diagnose.grouped_df
Diagnose data quality of categorical variablesdiagnose_category diagnose_category.data.frame diagnose_category.grouped_df
Diagnose data quality of categorical variables in the DBMSdiagnose_category.tbl_dbi
Diagnose data quality of numerical variablesdiagnose_numeric diagnose_numeric.data.frame diagnose_numeric.grouped_df
Diagnose data quality of numerical variables in the DBMSdiagnose_numeric.tbl_dbi
Diagnose outlier of numerical variablesdiagnose_outlier diagnose_outlier.data.frame diagnose_outlier.grouped_df
Diagnose outlier of numerical variables in the DBMSdiagnose_outlier.tbl_dbi
Reporting the information of data diagnosisdiagnose_paged_report diagnose_paged_report.data.frame
Reporting the information of data diagnosis for table of the DBMSdiagnose_paged_report.tbl_dbi
Reporting the information of data diagnosisdiagnose_report diagnose_report.data.frame
Reporting the information of data diagnosis for table of the DBMSdiagnose_report.tbl_dbi
Diagnosis of level combinations of categorical variablesdiagnose_sparese diagnose_sparese.data.frame
Reporting the information of data diagnosis with htmldiagnose_web_report diagnose_web_report.data.frame
Reporting the information of data diagnosis for table of the DBMS with htmldiagnose_web_report.tbl_dbi
Diagnose data quality of variables in the DBMSdiagnose.tbl_dbi
Generate paged HTML documentdlookr_blue_paged dlookr_orange_paged
dlookr HTML templatedlookr_templ_html
Reporting the information of EDAeda_paged_report eda_paged_report.data.frame
Reporting the information of EDA for table of the DBMSeda_paged_report.tbl_dbi
Reporting the information of EDAeda_report eda_report.data.frame
Reporting the information of EDA for table of the DBMSeda_report.tbl_dbi
Reporting the information of EDA with htmleda_web_report eda_web_report.data.frame
Reporting the information of EDA for table of the DBMS with htmleda_web_report.tbl_dbi
Calculate the entropyentropy
Extract bins from "bins"extract extract.bins
Extract variable names or indices of a specific classfind_class
Finding variables including missing valuesfind_na
Finding variables including outliersfind_outliers
Finding skewed variablesfind_skewness
Flights dataflights
Extracting a class of variablesget_class
Describe column of table in the DBMSget_column_info
Finding Users Machine's OSget_os
Finding percentileget_percentile
Transform a numeric vectorget_transform
Heart Failure Dataheartfailure
Import Google Fontsimport_google_font
Impute Missing Valuesimputate_na
Impute Outliersimputate_outlier
Job Change of Data Scientistsjobchange
Jensen-Shannon Divergencejsd
Kullback-Leibler Divergencekld
Kurtosis of the datakurtosis
Performs the Shapiro-Wilk test of normalitynormality normality.data.frame normality.grouped_df
Performs the Shapiro-Wilk test of normalitynormality.tbl_dbi
Describe overview of dataoverview
Diagnose Performance Binned Variableperformance_bin
Plot bar chart of categorical variablesplot_bar_category plot_bar_category.data.frame plot_bar_category.grouped_df
Plot Box-Plot of numerical variablesplot_box_numeric plot_box_numeric.data.frame plot_box_numeric.grouped_df
Plot histogram of numerical variablesplot_hist_numeric plot_hist_numeric.data.frame plot_hist_numeric.grouped_df
Combination chart for missing valueplot_na_hclust
Plot the combination variables that is include missing valueplot_na_intersect
Pareto chart for missing valueplot_na_pareto
Plot distribution information of numerical dataplot_normality plot_normality.data.frame plot_normality.grouped_df
Plot distribution information of numerical dataplot_normality.tbl_dbi
Plot outlier information of numerical data diagnosisplot_outlier plot_outlier.data.frame
Plot outlier information of target_dfplot_outlier.target_df
Plot outlier information of numerical data diagnosis in the DBMSplot_outlier.tbl_dbi
Plot Q-Q plot of numerical variablesplot_qq_numeric plot_qq_numeric.data.frame plot_qq_numeric.grouped_df
Visualize Distribution for a "bins" objectplot.bins
Visualize Information for an "compare_category" Objectplot.compare_category
Visualize Information for an "compare_numeric" Objectplot.compare_numeric
Visualize Information for an "correlate" Objectplot.correlate
Visualize Information for an "imputation" Objectplot.imputation
Visualize Distribution for an "infogain_bins" Objectplot.infogain_bins
Visualize Distribution for an "optimal_bins" Objectplot.optimal_bins
Visualize Information for an "overview" Objectplot.overview
Visualize Performance for an "performance_bin" Objectplot.performance_bin
Visualize Information for an "pps" Objectplot.pps
Visualize Information for an "relate" Objectplot.relate
Visualize Information for an "transform" Objectplot.transform
Visualize Information for an "univar_category" Objectplot.univar_category
Visualize Information for an "univar_numeric" Objectplot.univar_numeric
Compute Predictive Power Scorepps pps.data.frame pps.target_df
Summarizing relate informationprint.relate
Relationship between target variable and variable of interestrelate relate.target_df
Skewness of the dataskewness
Summarizing Binned Variableprint.bins summary.bins
Summarizing compare_category informationprint.compare_category summary.compare_category
Summarizing compare_numeric informationprint.compare_numeric summary.compare_numeric
Summarizing Correlation Coefficientsummary.correlate
Summarizing imputation informationsummary.imputation
Summarizing Performance for Optimal Binssummary.optimal_bins
Summarizing overview informationsummary.overview
Summarizing Performance for Binned Variablesummary.performance_bin
Summarizing Predictive Power Scoresummary.pps
Summarizing transformation informationprint.transform summary.transform
Summarizing univar_category informationprint.univar_category summary.univar_category
Summarizing univar_numeric informationprint.univar_numeric summary.univar_numeric
Target by one variablestarget_by target_by.data.frame
Target by one column in the DBMStarget_by.tbl_dbi
Theil's U statistictheil
Data Transformationstransform
Reporting the information of transformationtransformation_paged_report
Reporting the information of transformationtransformation_report
Reporting the information of transformation with htmltransformation_web_report
Statistic of univariate categorical variablesunivar_category univar_category.data.frame
Statistic of univariate numerical variablesunivar_numeric univar_numeric.data.frame