Package: dlookr 0.6.3

dlookr: Tools for Data Diagnosis, Exploration, Transformation

A collection of tools that support data diagnosis, exploration, and transformation. Data diagnostics provides information and visualization of missing values, outliers, and unique and negative values to help you understand the distribution and quality of your data. Data exploration provides information and visualization of the descriptive statistics of univariate variables, normality tests and outliers, correlation of two variables, and the relationship between the target variable and predictor. Data transformation supports binning for categorizing continuous variables, imputes missing values and outliers, and resolves skewness. And it creates automated reports that support these three tasks.

Authors:Choonghyun Ryu [aut, cre]

dlookr_0.6.3.tar.gz
dlookr_0.6.3.zip(r-4.5)dlookr_0.6.3.zip(r-4.4)dlookr_0.6.3.zip(r-4.3)
dlookr_0.6.3.tgz(r-4.4-any)dlookr_0.6.3.tgz(r-4.3-any)
dlookr_0.6.3.tar.gz(r-4.5-noble)dlookr_0.6.3.tar.gz(r-4.4-noble)
dlookr_0.6.3.tgz(r-4.4-emscripten)dlookr_0.6.3.tgz(r-4.3-emscripten)
dlookr.pdf |dlookr.html
dlookr/json (API)
NEWS

# Install 'dlookr' in R:
install.packages('dlookr', repos = c('https://choonghyunryu.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/choonghyunryu/dlookr/issues

Datasets:

On CRAN:

11.96 score 208 stars 2 packages 720 scripts 3.4k downloads 2 mentions 63 exports 126 dependencies

Last updated 5 months agofrom:1e3797e85e. Checks:OK: 5 NOTE: 2. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 06 2024
R-4.5-winNOTENov 06 2024
R-4.5-linuxNOTENov 06 2024
R-4.4-winOKNov 06 2024
R-4.4-macOKNov 06 2024
R-4.3-winOKNov 06 2024
R-4.3-macOKNov 06 2024

Exports:binningbinning_bybinning_rgrcompare_categorycompare_numericcorrelatecramerdescribediagnosediagnose_categorydiagnose_numericdiagnose_outlierdiagnose_paged_reportdiagnose_reportdiagnose_sparesediagnose_web_reportdlookr_blue_pageddlookr_orange_pageddlookr_templ_htmleda_paged_reporteda_reporteda_web_reportentropyextractfind_classfind_nafind_outliersfind_skewnessget_classget_column_infoget_osget_percentileget_transformimport_google_fontimputate_naimputate_outlierjsdkldkurtosisnormalityoverviewperformance_binplot_bar_categoryplot_box_numericplot_correlateplot_hist_numericplot_na_hclustplot_na_intersectplot_na_paretoplot_normalityplot_outlierplot_qq_numericppsrelateskewnesstarget_bytheiltransformtransformation_paged_reporttransformation_reporttransformation_web_reportunivar_categoryunivar_numeric

Dependencies:AsioHeadersbackportsbase64encbitbit64bookdownbootbroombslibcachemclicliprcodetoolscolorspacecommonmarkcpp11crayondigestdplyrevaluateextrafontextrafontdbfansifarverfastmapfontawesomefontBitstreamVerafontLiberationfontquiverforcatsforeachfsgdtoolsgenericsggplot2glmnetgluegridExtragtablehavenhighrhmshrbrthemeshtmltoolshtmlwidgetshttpuvisobanditeratorsjomojquerylibjsonlitekableExtraknitrlabelinglaterlatticelifecyclelme4magrittrMASSMatrixmemoisemgcvmicemimeminqamitmlmunsellnlmenloptrnnetnumDerivordinalpagedownpanpillarpkgconfigprettyunitsprocessxprogresspromisespspurrrR6rappdirsRColorBrewerRcppRcppEigenreactablereactRreadrrlangrmarkdownrpartrstudioapiRttf2pt1sassscalesservrshapeshinyshowtextshowtextdbsourcetoolsstringistringrsurvivalsvglitesysfontssystemfontstibbletidyrtidyselecttinytextzdbucminfutf8vctrsviridisLitevroomwebsocketwithrxfunxml2xtableyaml

Data quality diagnosis

Rendered fromdiagonosis.Rmdusingknitr::rmarkdownon Nov 06 2024.

Last update: 2024-06-09
Started: 2018-04-23

Data Transformation

Rendered fromtransformation.Rmdusingknitr::rmarkdownon Nov 06 2024.

Last update: 2024-06-09
Started: 2018-04-23

Exploratory Data Analysis

Rendered fromEDA.Rmdusingknitr::rmarkdownon Nov 06 2024.

Last update: 2024-06-09
Started: 2018-04-23

Introduce dlookr

Rendered fromIntroduce.Rmdusingknitr::rmarkdownon Nov 06 2024.

Last update: 2023-12-29
Started: 2020-09-02

Readme and manuals

Help Manual

Help pageTopics
dlookr: Tools for Data Diagnosis, Exploration, Transformationdlookr-package dlookr
Binning the Numeric Databinning
Optimal Binning for Scoring Modelingbinning_by
Binning by recursive information gain ratio maximizationbinning_rgr
Sales of Child Car SeatsCarseats
Compare categorical variablescompare_category compare_category.data.frame
Compare numerical variablescompare_numeric compare_numeric.data.frame
Compute the correlation coefficient between two variablecorrelate correlate.data.frame correlate.grouped_df correlate.tbl_dbi
Cramer's V statisticcramer
Compute descriptive statisticdescribe describe.data.frame describe.grouped_df
Compute descriptive statisticdescribe.tbl_dbi
Diagnose data quality of variablesdiagnose diagnose.data.frame diagnose.grouped_df
Diagnose data quality of categorical variablesdiagnose_category diagnose_category.data.frame diagnose_category.grouped_df
Diagnose data quality of categorical variables in the DBMSdiagnose_category.tbl_dbi
Diagnose data quality of numerical variablesdiagnose_numeric diagnose_numeric.data.frame diagnose_numeric.grouped_df
Diagnose data quality of numerical variables in the DBMSdiagnose_numeric.tbl_dbi
Diagnose outlier of numerical variablesdiagnose_outlier diagnose_outlier.data.frame diagnose_outlier.grouped_df
Diagnose outlier of numerical variables in the DBMSdiagnose_outlier.tbl_dbi
Reporting the information of data diagnosisdiagnose_paged_report diagnose_paged_report.data.frame
Reporting the information of data diagnosis for table of the DBMSdiagnose_paged_report.tbl_dbi
Reporting the information of data diagnosisdiagnose_report diagnose_report.data.frame
Reporting the information of data diagnosis for table of the DBMSdiagnose_report.tbl_dbi
Diagnosis of level combinations of categorical variablesdiagnose_sparese diagnose_sparese.data.frame
Reporting the information of data diagnosis with htmldiagnose_web_report diagnose_web_report.data.frame
Reporting the information of data diagnosis for table of the DBMS with htmldiagnose_web_report.tbl_dbi
Diagnose data quality of variables in the DBMSdiagnose.tbl_dbi
Generate paged HTML documentdlookr_blue_paged dlookr_orange_paged
dlookr HTML templatedlookr_templ_html
Reporting the information of EDAeda_paged_report eda_paged_report.data.frame
Reporting the information of EDA for table of the DBMSeda_paged_report.tbl_dbi
Reporting the information of EDAeda_report eda_report.data.frame
Reporting the information of EDA for table of the DBMSeda_report.tbl_dbi
Reporting the information of EDA with htmleda_web_report eda_web_report.data.frame
Reporting the information of EDA for table of the DBMS with htmleda_web_report.tbl_dbi
Calculate the entropyentropy
Extract bins from "bins"extract extract.bins
Extract variable names or indices of a specific classfind_class
Finding variables including missing valuesfind_na
Finding variables including outliersfind_outliers
Finding skewed variablesfind_skewness
Flights dataflights
Extracting a class of variablesget_class
Describe column of table in the DBMSget_column_info
Finding Users Machine's OSget_os
Finding percentileget_percentile
Transform a numeric vectorget_transform
Heart Failure Dataheartfailure
Import Google Fontsimport_google_font
Impute Missing Valuesimputate_na
Impute Outliersimputate_outlier
Job Change of Data Scientistsjobchange
Jensen-Shannon Divergencejsd
Kullback-Leibler Divergencekld
Kurtosis of the datakurtosis
Performs the Shapiro-Wilk test of normalitynormality normality.data.frame normality.grouped_df
Performs the Shapiro-Wilk test of normalitynormality.tbl_dbi
Describe overview of dataoverview
Diagnose Performance Binned Variableperformance_bin
Plot bar chart of categorical variablesplot_bar_category plot_bar_category.data.frame plot_bar_category.grouped_df
Plot Box-Plot of numerical variablesplot_box_numeric plot_box_numeric.data.frame plot_box_numeric.grouped_df
Plot histogram of numerical variablesplot_hist_numeric plot_hist_numeric.data.frame plot_hist_numeric.grouped_df
Combination chart for missing valueplot_na_hclust
Plot the combination variables that is include missing valueplot_na_intersect
Pareto chart for missing valueplot_na_pareto
Plot distribution information of numerical dataplot_normality plot_normality.data.frame plot_normality.grouped_df
Plot distribution information of numerical dataplot_normality.tbl_dbi
Plot outlier information of numerical data diagnosisplot_outlier plot_outlier.data.frame
Plot outlier information of target_dfplot_outlier.target_df
Plot outlier information of numerical data diagnosis in the DBMSplot_outlier.tbl_dbi
Plot Q-Q plot of numerical variablesplot_qq_numeric plot_qq_numeric.data.frame plot_qq_numeric.grouped_df
Visualize Distribution for a "bins" objectplot.bins
Visualize Information for an "compare_category" Objectplot.compare_category
Visualize Information for an "compare_numeric" Objectplot.compare_numeric
Visualize Information for an "correlate" Objectplot.correlate
Visualize Information for an "imputation" Objectplot.imputation
Visualize Distribution for an "infogain_bins" Objectplot.infogain_bins
Visualize Distribution for an "optimal_bins" Objectplot.optimal_bins
Visualize Information for an "overview" Objectplot.overview
Visualize Performance for an "performance_bin" Objectplot.performance_bin
Visualize Information for an "pps" Objectplot.pps
Visualize Information for an "relate" Objectplot.relate
Visualize Information for an "transform" Objectplot.transform
Visualize Information for an "univar_category" Objectplot.univar_category
Visualize Information for an "univar_numeric" Objectplot.univar_numeric
Compute Predictive Power Scorepps pps.data.frame pps.target_df
Summarizing relate informationprint.relate
Relationship between target variable and variable of interestrelate relate.target_df
Skewness of the dataskewness
Summarizing Binned Variableprint.bins summary.bins
Summarizing compare_category informationprint.compare_category summary.compare_category
Summarizing compare_numeric informationprint.compare_numeric summary.compare_numeric
Summarizing Correlation Coefficientsummary.correlate
Summarizing imputation informationsummary.imputation
Summarizing Performance for Optimal Binssummary.optimal_bins
Summarizing overview informationsummary.overview
Summarizing Performance for Binned Variablesummary.performance_bin
Summarizing Predictive Power Scoresummary.pps
Summarizing transformation informationprint.transform summary.transform
Summarizing univar_category informationprint.univar_category summary.univar_category
Summarizing univar_numeric informationprint.univar_numeric summary.univar_numeric
Target by one variablestarget_by target_by.data.frame
Target by one column in the DBMStarget_by.tbl_dbi
Theil's U statistictheil
Data Transformationstransform
Reporting the information of transformationtransformation_paged_report
Reporting the information of transformationtransformation_report
Reporting the information of transformation with htmltransformation_web_report
Statistic of univariate categorical variablesunivar_category univar_category.data.frame
Statistic of univariate numerical variablesunivar_numeric univar_numeric.data.frame