{"id":1240,"date":"2018-07-30T21:14:51","date_gmt":"2018-07-30T21:14:51","guid":{"rendered":"https:\/\/www.codeastar.com\/?p=1240"},"modified":"2018-10-25T09:59:59","modified_gmt":"2018-10-25T09:59:59","slug":"blending-data-science-competition","status":"publish","type":"post","link":"https:\/\/www.codeastar.com\/blending-data-science-competition\/","title":{"rendered":"Blending, the Dark Side in Data Science Competition"},"content":{"rendered":"<p>In our past machine learning topic, &#8220;<a href=\"https:\/\/www.codeastar.com\/data-science-ensemble-modeling\/\">Ensemble Modeling<\/a>&#8220;, we mentioned how blending helps on improving our prediction. Then in another topic, &#8220;<a href=\"https:\/\/www.codeastar.com\/kaggle-click-fraud-detection-post-match\/\">Why are people frustrated on Kaggle\u2019s challenge?<\/a>&#8220;, we mentioned how blending ruins a data science competition. Okay, we have a question here, is blending good or bad?<\/p>\n<p><!--more--><\/p>\n<h3>Blending and Frustration<\/h3>\n<p>Technically, blending is good and we did prove it by improving our <a href=\"https:\/\/www.codeastar.com\/win-big-real-estate-market-data-science\/\">Iowa House Price<\/a>&#8216;s prediction. The technique itself is not an issue. But the way to use it is. From the last <a href=\"https:\/\/www.codeastar.com\/click-fraud-detection\/\">TalkingData Click Fraud Detection<\/a> challenge, people spent days and nights on features engineering and models researching. They posted and shared their results as public kernels. Then some people, we call them &#8220;blenders&#8221;, gathered other people&#8217;s hard working results, applied the blending technique in 5 minutes and got a better result. It made them have a higher ranking in leaderboard too. If you were one of those hard working developers and got out-ranked by blenders, you might be frustrated.<\/p>\n<h3>Get a better result <del>by taking advantage of others<\/del><\/h3>\n<p>We should not abuse other people&#8217;s hard work, but we should know how people do that by blending. So we start our experiment in the House Price prediction challenge. First of all, we go to collect output files from 7 best <a href=\"https:\/\/www.codeastar.com\/regression-model-rmsd\/\">RMSD<\/a> public kernels. So we have:<\/p>\n<ol>\n<li><a href=\"https:\/\/www.kaggle.com\/agehsbarg\/top-10-0-10943-stacking-mice-and-brutal-force\/code\" target=\"_blank\" rel=\"noopener\">stacking, MICE and brutal force<\/a>\u00a0 &#8211; 0.10985<\/li>\n<li><a href=\"https:\/\/www.kaggle.com\/zavodrobotov\/lasso-model-for-regression-problem\/\" target=\"_blank\" rel=\"noopener\">Lasso model for regression problem<\/a> &#8211; 0.11365<\/li>\n<li><a href=\"https:\/\/www.kaggle.com\/harunshimanto\/house-price-prediction-from-bangladesh\" target=\"_blank\" rel=\"noopener\">House Price Prediction From Bangladesh<\/a> &#8211; 0.11416<\/li>\n<li><a href=\"https:\/\/www.kaggle.com\/massquantity\/all-you-need-is-pca-lb-0-11421-top-4\" target=\"_blank\" rel=\"noopener\">All You Need is PCA<\/a> &#8211; 0.11421<\/li>\n<li><a href=\"https:\/\/www.kaggle.com\/aharless\/amit-choudhary-s-kernel-notebook-ified\/\" target=\"_blank\" rel=\"noopener\">Amit Choudhary&#8217;s Kernel Notebook-ified<\/a> &#8211; 0.11439<\/li>\n<li><a href=\"https:\/\/www.kaggle.com\/liuhdsgoal\/just-nn-use-gluon-top-5\" target=\"_blank\" rel=\"noopener\">just NN use gluon<\/a> &#8211; 0.1148<\/li>\n<li><a class=\"kernel-header__title\" href=\"https:\/\/www.kaggle.com\/iamprateek\/my-submission-to-predict-sale-price\" target=\"_blank\" rel=\"noopener\">My submission to predict sale price<\/a>\u00a0&#8211; 0.11533\n<div class=\"kernel-header__fork-info\"><\/div>\n<div class=\"kernel-header__thumbnails\">Please note that other than selecting kernels by scores, we trend to select kernels using different model(s).<\/div>\n<\/li>\n<\/ol>\n<p>Then we can download output files from above kernels, open our own kernel (or\u00a0Jupyter Notebook) and import the output files as our input (feel like the way we did on the <a href=\"https:\/\/www.codeastar.com\/convolutional-neural-network-python\/\">CNN image recognizer<\/a> project: outputs from previous layer are inputs of next layer).<\/p>\n<pre lang=\"python\" line=\"1\">import pandas as pd\r\n\r\ndf_base_0 = pd.read_csv('..\/input\/stacking-mice-and-brutal-force-10985\/House_Prices_submit.csv',names=[\"Id\",\"SalePrice_0\"], skiprows=[0],header=None)\r\ndf_base_1 = pd.read_csv('..\/input\/lasso-11365\/lasso_sol22_Median.csv',names=[\"Id\",\"SalePrice_1\"], skiprows=[0],header=None)\r\ndf_base_2 = pd.read_csv('..\/input\/bangladesh-stack-11416\/submission (1).csv',names=[\"Id\",\"SalePrice_2\"], skiprows=[0],header=None)\r\ndf_base_3 = pd.read_csv('..\/input\/pca-11421\/submission (2).csv',names=[\"Id\",\"SalePrice_3\"], skiprows=[0],header=None)\r\ndf_base_4 = pd.read_csv('..\/input\/xgb-lasso-11439\/output.csv',names=[\"Id\",\"SalePrice_4\"], skiprows=[0],header=None)\r\ndf_base_5 = pd.read_csv('..\/input\/nn-1148\/submission (3).csv',names=[\"Id\",\"SalePrice_5\"], skiprows=[0],header=None)\r\ndf_base_6 = pd.read_csv('..\/input\/stack-xgb-lgb-11533\/submission_stacked.csv',names=[\"Id\",\"SalePrice_6\"], skiprows=[0],header=None)\r\n<\/pre>\n<p>We have 7 dataframes and all of them contain &#8220;Id&#8221; and &#8220;SalePrice&#8221; fields. Then we pick 2 dataframes, &#8220;df_base_0&#8221; and &#8220;df_base_5&#8221; as examples:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"1247\" data-permalink=\"https:\/\/www.codeastar.com\/blending-data-science-competition\/df\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/df.png?fit=479%2C359&amp;ssl=1\" data-orig-size=\"479,359\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"df\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/df.png?fit=300%2C225&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/df.png?fit=479%2C359&amp;ssl=1\" class=\"aligncenter size-full wp-image-1247\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/df.png?resize=479%2C359&#038;ssl=1\" alt=\"dataframe samples\" width=\"479\" height=\"359\" srcset=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/df.png?w=479&amp;ssl=1 479w, https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/df.png?resize=300%2C225&amp;ssl=1 300w\" sizes=\"auto, (max-width: 479px) 100vw, 479px\" \/><\/p>\n<p>All of our dataframes have the same Id but different SalePrice. So we can merge them into a single dataframe, &#8220;df_base&#8221;, using the Id as the key.<\/p>\n<pre lang=\"python\" line=\"1\">df_base = pd.merge(df_base_0,df_base_1,how='inner',on='Id')\r\ndf_base = pd.merge(df_base,df_base_2,how='inner',on='Id')\r\ndf_base = pd.merge(df_base,df_base_3,how='inner',on='Id')\r\ndf_base = pd.merge(df_base,df_base_4,how='inner',on='Id')\r\ndf_base = pd.merge(df_base,df_base_5,how='inner',on='Id')\r\ndf_base = pd.merge(df_base,df_base_6,how='inner',on='Id')\r\n<\/pre>\n<p>Here it comes:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"1249\" data-permalink=\"https:\/\/www.codeastar.com\/blending-data-science-competition\/dfbase\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/dfbase.png?fit=794%2C253&amp;ssl=1\" data-orig-size=\"794,253\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"dfbase\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/dfbase.png?fit=300%2C96&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/dfbase.png?fit=794%2C253&amp;ssl=1\" class=\"aligncenter size-full wp-image-1249\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/dfbase.png?resize=794%2C253&#038;ssl=1\" alt=\"df base\" width=\"794\" height=\"253\" srcset=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/dfbase.png?w=794&amp;ssl=1 794w, https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/dfbase.png?resize=300%2C96&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/dfbase.png?resize=768%2C245&amp;ssl=1 768w\" sizes=\"auto, (max-width: 794px) 100vw, 794px\" \/><\/p>\n<p>Instead of blending all the SalePrices and getting the mean score, we can move one step forward to get a better result.<\/p>\n<h3>Blend by Correlation<\/h3>\n<p>In order to get the better result, we should blend with different sources. That is why we intended to use output files from different models. We can also visualize how those output files are different from one another using <a href=\"https:\/\/www.codeastar.com\/tfidf-predict-deal-probability\/\">interactive<\/a> heatmap.<\/p>\n<pre lang=\"python\" line=\"1\">import plotly.graph_objs as go\r\nimport plotly.offline as py\r\npy.init_notebook_mode(connected=True)\r\n\r\ndata = [\r\n    go.Heatmap(\r\n        z = df_base.iloc[:,1:].corr().values,\r\n        x = df_base.iloc[:,1:].columns.values,\r\n        y = df_base.iloc[:,1:].columns.values,\r\n        colorscale='Earth')\r\n]\r\n\r\nlayout = go.Layout(\r\n    title ='Correlation of SalePrice',\r\n    xaxis = dict(ticks='outside', nticks=36),\r\n    yaxis = dict(ticks='outside' ),\r\n    width = 800, height = 700)\r\n\r\nfig = go.Figure(data=data, layout=layout)\r\npy.iplot(fig)\r\n<\/pre>\n<p><iframe loading=\"lazy\" src=\"\/\/plot.ly\/~codeastar\/9.embed\" width=\"800\" height=\"700\" frameborder=\"0\" scrolling=\"no\"><span data-mce-type=\"bookmark\" style=\"display: inline-block; width: 0px; overflow: hidden; line-height: 0;\" class=\"mce_SELRES_start\">\ufeff<\/span><\/iframe><\/p>\n<p>Since we are looking for\u00a0diversity, we pick those SalePrices with low correlation scores. So we have:<\/p>\n<ul>\n<li>SalePrice_2 (the base)<\/li>\n<li>SalePrice_0 (score: 0.92)<\/li>\n<li>SalePrice_4 (score: 0.95)<\/li>\n<li>SalePrice_5 (score: 0.93)<\/li>\n<\/ul>\n<p>i.e. the bottom 3rd row from the heatmap<\/p>\n<p>And the magin begins:<\/p>\n<pre lang=\"python\" line=\"1\">new_sp = df_base.SalePrice_0 *.25 + df_base.SalePrice_2 *.25 +df_base.SalePrice_4 *.25 +df_base.SalePrice_5 *.25\r\n\r\nsub_sp= pd.DataFrame()\r\nsub_sp['id'] = df_base['Id']\r\nsub_sp['SalePrice'] = new_sp\r\nsub_sp.to_csv('sub_sp.csv', index=False, float_format='%.9f')\r\n<\/pre>\n<p>It turns out making a better score as 0.10970 and pushing me to the Top 1% ranking position.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"1256\" data-permalink=\"https:\/\/www.codeastar.com\/blending-data-science-competition\/top13\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/top13.png?fit=353%2C81&amp;ssl=1\" data-orig-size=\"353,81\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"top13\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/top13.png?fit=300%2C69&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/top13.png?fit=353%2C81&amp;ssl=1\" class=\"aligncenter size-full wp-image-1256\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/top13.png?resize=353%2C81&#038;ssl=1\" alt=\"Top 1%\" width=\"353\" height=\"81\" srcset=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/top13.png?w=353&amp;ssl=1 353w, https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/top13.png?resize=300%2C69&amp;ssl=1 300w\" sizes=\"auto, (max-width: 353px) 100vw, 353px\" \/>We got the Top 1% position in just few minutes and lines of code, isn&#8217;t it nice?<\/p>\n<p><em>Nope<\/em>.<\/p>\n<p>We are just using other people&#8217;s hard work and learn nothing from it. Last time we worked with 3 posts for solving the Iowa&#8217;s House Price challenge, which included feature engineering, <a href=\"https:\/\/www.codeastar.com\/data-science-model-params-tuning\/\">parameter tuning<\/a> and model ensemble topics. Those topics are valuable lessons for making us a better data scientist. Blending is a useful technique, in order to use it for good, I would suggest we use it, once we have made an output from our owns. Then, we apply blending with others. So we can keep learning and get better results.<\/p>\n<p>&nbsp;<\/p>\n<h3>What have we learnt in this post?<\/h3>\n<ol>\n<li>Usage of blending with correlation<\/li>\n<li>Good side of blending<\/li>\n<li>Bad side of blending<\/li>\n<li>Proper way to blend<\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>In our past machine learning topic, &#8220;Ensemble Modeling&#8220;, we mentioned how blending helps on improving our prediction. Then in another topic, &#8220;Why are people frustrated on Kaggle\u2019s challenge?&#8220;, we mentioned how blending ruins a data science competition. Okay, we have a question here, is blending good or bad?<\/p>\n","protected":false},"author":1,"featured_media":1260,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"site-sidebar-layout":"default","site-content-layout":"default","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false},"version":2}},"categories":[18],"tags":[78,19,20,41,93,30,92],"class_list":["post-1240","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-machine-learning","tag-blending","tag-data-science","tag-data-scientist","tag-ensemble","tag-heatmap","tag-kaggle","tag-plotly"],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.9 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Blending, the Dark Side in Data Science Competition &#8902; Code A Star<\/title>\n<meta name=\"description\" content=\"We have mentioned how blending helps on improving our prediction and how blending makes people feel frustrated. So, is blending good or bad?\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.codeastar.com\/blending-data-science-competition\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Blending, the Dark Side in Data Science Competition &#8902; Code A Star\" \/>\n<meta property=\"og:description\" content=\"We have mentioned how blending helps on improving our prediction and how blending makes people feel frustrated. So, is blending good or bad?\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.codeastar.com\/blending-data-science-competition\/\" \/>\n<meta property=\"og:site_name\" content=\"Code A Star\" \/>\n<meta property=\"article:publisher\" content=\"codeastar\" \/>\n<meta property=\"article:author\" content=\"codeastar\" \/>\n<meta property=\"article:published_time\" content=\"2018-07-30T21:14:51+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2018-10-25T09:59:59+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1\" \/>\n\t<meta property=\"og:image:width\" content=\"568\" \/>\n\t<meta property=\"og:image:height\" content=\"673\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Raven Hon\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@codeastar\" \/>\n<meta name=\"twitter:site\" content=\"@codeastar\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Raven Hon\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/\"},\"author\":{\"name\":\"Raven Hon\",\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"headline\":\"Blending, the Dark Side in Data Science Competition\",\"datePublished\":\"2018-07-30T21:14:51+00:00\",\"dateModified\":\"2018-10-25T09:59:59+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/\"},\"wordCount\":636,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"image\":{\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1\",\"keywords\":[\"blending\",\"Data Science\",\"Data Scientist\",\"ensemble\",\"heatmap\",\"Kaggle\",\"plotly\"],\"articleSection\":[\"Learn Machine Learning\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.codeastar.com\/blending-data-science-competition\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/\",\"url\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/\",\"name\":\"Blending, the Dark Side in Data Science Competition &#8902; Code A Star\",\"isPartOf\":{\"@id\":\"https:\/\/www.codeastar.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1\",\"datePublished\":\"2018-07-30T21:14:51+00:00\",\"dateModified\":\"2018-10-25T09:59:59+00:00\",\"description\":\"We have mentioned how blending helps on improving our prediction and how blending makes people feel frustrated. So, is blending good or bad?\",\"breadcrumb\":{\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.codeastar.com\/blending-data-science-competition\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1\",\"width\":568,\"height\":673,\"caption\":\"The Dark Side of Blending\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.codeastar.com\/blending-data-science-competition\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.codeastar.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Blending, the Dark Side in Data Science Competition\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.codeastar.com\/#website\",\"url\":\"https:\/\/www.codeastar.com\/\",\"name\":\"Code A Star\",\"description\":\"We don&#039;t wish upon a star, we code a star\",\"publisher\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.codeastar.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\",\"name\":\"Raven Hon\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1\",\"width\":70,\"height\":70,\"caption\":\"Raven Hon\"},\"logo\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/\"},\"description\":\"Raven Hon is\u00a0a 20 years+ veteran in information technology industry who has worked on various projects from console, web, game, banking and mobile applications in different sized companies.\",\"sameAs\":[\"https:\/\/www.codeastar.com\",\"codeastar\",\"https:\/\/x.com\/codeastar\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Blending, the Dark Side in Data Science Competition &#8902; Code A Star","description":"We have mentioned how blending helps on improving our prediction and how blending makes people feel frustrated. So, is blending good or bad?","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.codeastar.com\/blending-data-science-competition\/","og_locale":"en_US","og_type":"article","og_title":"Blending, the Dark Side in Data Science Competition &#8902; Code A Star","og_description":"We have mentioned how blending helps on improving our prediction and how blending makes people feel frustrated. So, is blending good or bad?","og_url":"https:\/\/www.codeastar.com\/blending-data-science-competition\/","og_site_name":"Code A Star","article_publisher":"codeastar","article_author":"codeastar","article_published_time":"2018-07-30T21:14:51+00:00","article_modified_time":"2018-10-25T09:59:59+00:00","og_image":[{"width":568,"height":673,"url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1","type":"image\/png"}],"author":"Raven Hon","twitter_card":"summary_large_image","twitter_creator":"@codeastar","twitter_site":"@codeastar","twitter_misc":{"Written by":"Raven Hon","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/#article","isPartOf":{"@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/"},"author":{"name":"Raven Hon","@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"headline":"Blending, the Dark Side in Data Science Competition","datePublished":"2018-07-30T21:14:51+00:00","dateModified":"2018-10-25T09:59:59+00:00","mainEntityOfPage":{"@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/"},"wordCount":636,"commentCount":0,"publisher":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"image":{"@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1","keywords":["blending","Data Science","Data Scientist","ensemble","heatmap","Kaggle","plotly"],"articleSection":["Learn Machine Learning"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.codeastar.com\/blending-data-science-competition\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/","url":"https:\/\/www.codeastar.com\/blending-data-science-competition\/","name":"Blending, the Dark Side in Data Science Competition &#8902; Code A Star","isPartOf":{"@id":"https:\/\/www.codeastar.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/#primaryimage"},"image":{"@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1","datePublished":"2018-07-30T21:14:51+00:00","dateModified":"2018-10-25T09:59:59+00:00","description":"We have mentioned how blending helps on improving our prediction and how blending makes people feel frustrated. So, is blending good or bad?","breadcrumb":{"@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.codeastar.com\/blending-data-science-competition\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/#primaryimage","url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1","width":568,"height":673,"caption":"The Dark Side of Blending"},{"@type":"BreadcrumbList","@id":"https:\/\/www.codeastar.com\/blending-data-science-competition\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.codeastar.com\/"},{"@type":"ListItem","position":2,"name":"Blending, the Dark Side in Data Science Competition"}]},{"@type":"WebSite","@id":"https:\/\/www.codeastar.com\/#website","url":"https:\/\/www.codeastar.com\/","name":"Code A Star","description":"We don&#039;t wish upon a star, we code a star","publisher":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.codeastar.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd","name":"Raven Hon","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/","url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1","width":70,"height":70,"caption":"Raven Hon"},"logo":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/"},"description":"Raven Hon is\u00a0a 20 years+ veteran in information technology industry who has worked on various projects from console, web, game, banking and mobile applications in different sized companies.","sameAs":["https:\/\/www.codeastar.com","codeastar","https:\/\/x.com\/codeastar"]}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/07\/darkhappy-e1532985697282.png?fit=568%2C673&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p8PcRO-k0","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1240","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/comments?post=1240"}],"version-history":[{"count":17,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1240\/revisions"}],"predecessor-version":[{"id":1439,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1240\/revisions\/1439"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/media\/1260"}],"wp:attachment":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/media?parent=1240"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/categories?post=1240"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/tags?post=1240"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}