{"id":1631,"date":"2019-01-16T20:50:09","date_gmt":"2019-01-16T20:50:09","guid":{"rendered":"https:\/\/www.codeastar.com\/?p=1631"},"modified":"2019-01-17T02:59:31","modified_gmt":"2019-01-17T02:59:31","slug":"get-rich-stock-trading-machine-learning","status":"publish","type":"post","link":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/","title":{"rendered":"Stock Trading with Machine Learning and Get Rich"},"content":{"rendered":"\n<p>Okay, I admit it, it looks like a clickbait headline :]] (yes, we did the <a href=\"https:\/\/www.codeastar.com\/win-big-real-estate-market-data-science\/\">similar thing<\/a> before :]] ). But this is not a clickbait at all, as we are actually discussing this topic this time. There is a Kaggle&#8217;s challenge on <a rel=\"noreferrer noopener\" aria-label=\"predicting stock trading trend (opens in a new tab)\" href=\"https:\/\/www.kaggle.com\/c\/two-sigma-financial-news\" target=\"_blank\">predicting stock trading trend<\/a>, which is a good fit for our topic. So we use this challenge to start our journey to get rich! (it always feels good to use &#8220;encouraging&#8221; line :]] )<\/p>\n\n\n\n<!--more-->\n\n\n\n<h3 class=\"wp-block-heading\">Stock Trading Datasets<\/h3>\n\n\n\n<p>Likes all our previous machine learning projects, we start our journey by getting the related datasets. Then this time, we have encountered a situation. In this stock trading challenge, we can only use APIs and kernel provided by Kaggle, i.e. we can only load the datasets through Kaggle&#8217;s APIs.  For the usage of this specific API, we can take a look on Kaggle&#8217; <a rel=\"noreferrer noopener\" href=\"https:\/\/www.kaggle.com\/dster\/two-sigma-news-official-getting-started-kernel\" target=\"_blank\">stock trading challenge official getting started kernel<\/a>.<\/p>\n\n\n\n<p>Once we have loaded the datasets, &#8220;<em>market_train_df<\/em>&#8221; and &#8220;<em>news_train_df<\/em>&#8220;, with Kaggle&#8217;s API, we can take a look on their content:<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"197\" data-attachment-id=\"1634\" data-permalink=\"https:\/\/www.codeastar.com\/market_head\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/market_head-e1546351425406.png?fit=600%2C197&amp;ssl=1\" data-orig-size=\"600,197\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"market_head\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;Market training dataframe&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/market_head-e1546351425406.png?fit=300%2C99&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/market_head-e1546351425406.png?fit=600%2C197&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/market_head-e1546351425406.png?resize=600%2C197&#038;ssl=1\" alt=\"Market training dataframe\" class=\"wp-image-1634\"\/><figcaption> <em>market_train_df<\/em><\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"250\" data-attachment-id=\"1635\" data-permalink=\"https:\/\/www.codeastar.com\/news_head\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/news_head-e1546357421347.png?fit=600%2C250&amp;ssl=1\" data-orig-size=\"600,250\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"news_head\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;news_train_df&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/news_head-e1546357421347.png?fit=300%2C125&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/news_head-e1546357421347.png?fit=600%2C250&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/news_head-e1546357421347.png?resize=600%2C250&#038;ssl=1\" alt=\"News training dataframe\" class=\"wp-image-1635\"\/><figcaption><em>news_train_df<\/em><\/figcaption><\/figure>\n\n\n\n<p>&#8220;<em>market_train_df<\/em>&#8221; is a dataframe that contains market information such as stock code, open price, close price, trading volume, etc. While &#8220;<em>news_train_df<\/em>&#8221; is a dataframe that stores stocks related news information, such as headline, tag, word counts, the probability of rather the news is positive or negative, etc..<\/p>\n\n\n\n<p>Every data science challenge comes with a target to solve. What is the target in this challenge then? Since this challenge is about stock trading, in order to get rich, we have to predict the stock price in future. In this challenge, we are going to predict the probability of a stock going up or down in next 10 days.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">EDA on Stock Trading <\/h3>\n\n\n\n<p>&#8220;A picture is worth a thousand words&#8221;. That is why we always use EDA (Exploratory Data Analysis) to visualize our findings. First, let&#8217;s take a look on our market training dataset. Since there are about 3800 stock codes in 2007 to 2016 date range, we pick 5 stock codes we have mentioned in CodeAStar blog previously for our EDA. So we have: <em>Alphabet<\/em> (<em>Google<\/em>), <em>Amazon<\/em>, <em>Apple<\/em>, <em>eBay<\/em> and <em>Microsoft<\/em>.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">import plotly.graph_objs as go\nimport plotly.offline as py\npy.init_notebook_mode(connected=True)\n\ndata = []\nstock_name_arr = ['Microsoft Corp', 'Amazon.com Inc', 'Alphabet Inc', 'Apple Inc', 'eBay Inc']\nfor stock_name in stock_name_arr:\n    trace = go.Scatter(\n            x = market_train_df.loc[market_train_df['assetName'] == stock_name]['time'].dt.strftime(date_format='%Y-%m-%d').values,\n            y = market_train_df.loc[market_train_df['assetName'] == stock_name]['close'].values,\n            name=stock_name\n        )\n    data.append(trace)\n\nlayout = go.Layout(\n                title = \"Stocks Price Chart\",\n                xaxis=dict(\n                  title='Date',\n                  rangeslider=dict(visible=True),\n                  type='date'\n                ),\n                yaxis=dict(\n                  title=\"Price (USD)\",\n                  type='log',\n                  autorange=True\n                )               \n             )\nfig = go.Figure(data=data, layout=layout)\npy.iplot(fig)\n<\/pre>\n\n\n\n<p>Here we go:<\/p>\n\n\n\n<iframe loading=\"lazy\" width=\"800\" height=\"500\" frameborder=\"0\" scrolling=\"no\" src=\"\/\/plot.ly\/~codeastar\/19.embed\"><\/iframe>\n\n\n\n<p>You may see something strange there on the chart, yes, the stock prices of Apple and eBay <strong>dropped vertically<\/strong> on 2 occasions.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"275\" height=\"218\" data-attachment-id=\"1644\" data-permalink=\"https:\/\/www.codeastar.com\/shock\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/shock.gif?fit=275%2C218&amp;ssl=1\" data-orig-size=\"275,218\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"shock\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/shock.gif?fit=275%2C218&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/shock.gif?fit=275%2C218&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/shock.gif?resize=275%2C218&#038;ssl=1\" alt=\"shocked\" class=\"wp-image-1644\"\/><\/figure><\/div>\n\n\n\n<p>Don&#8217;t panic. The stock data is fine without any error. As there were <a rel=\"noreferrer noopener\" aria-label=\"stock splits (opens in a new tab)\" href=\"https:\/\/www.investopedia.com\/ask\/answers\/what-stock-split-why-do-stocks-split\/\" target=\"_blank\">stock splits<\/a> happened on both Apple and eBay stocks. The basic idea of stock split is, a corporation deciding to increase its total number of shares outstanding without altering the current market value.<\/p>\n\n\n\n<p>We can go to <a rel=\"noreferrer noopener\" aria-label=\"Yahoo Finance (opens in a new tab)\" href=\"https:\/\/finance.yahoo.com\/\" target=\"_blank\">Yahoo Finance<\/a> to check stock split history of stocks. <\/p>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"243\" data-attachment-id=\"1647\" data-permalink=\"https:\/\/www.codeastar.com\/applesh\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/applesh-e1546834732880.png?fit=500%2C243&amp;ssl=1\" data-orig-size=\"500,243\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"applesh\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;Apple&#8217; stock split&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/applesh-e1546834732880.png?fit=300%2C146&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/applesh-e1546834732880.png?fit=500%2C243&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/applesh-e1546834732880.png?resize=500%2C243&#038;ssl=1\" alt=\"Apple' stock split\" class=\"wp-image-1647\"\/><figcaption>Apple&#8217; stock split<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"244\" data-attachment-id=\"1646\" data-permalink=\"https:\/\/www.codeastar.com\/ebaysh\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/ebaysh-e1546834672982.png?fit=500%2C244&amp;ssl=1\" data-orig-size=\"500,244\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ebaysh\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;eBay&#8217; stock split&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/ebaysh-e1546834672982.png?fit=300%2C146&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/ebaysh-e1546834672982.png?fit=500%2C244&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/ebaysh-e1546834672982.png?resize=500%2C244&#038;ssl=1\" alt=\"eBay' stock split\" class=\"wp-image-1646\"\/><figcaption>eBay&#8217; stock split<\/figcaption><\/figure>\n\n\n\n<p>Luckily, the training dataset has already included the stock spilt data. Let&#8217;s use Apple&#8217; stock price as our example:<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">market_train_df.loc[(market_train_df['time'] &gt; '2014-06-05') &amp;  (market_train_df['assetCode'] == 'AAPL.O')]<\/pre>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" decoding=\"async\" data-attachment-id=\"1649\" data-permalink=\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/applesh2\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/applesh2-e1546837263297.png?fit=600%2C110&amp;ssl=1\" data-orig-size=\"600,110\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"applesh2\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/applesh2-e1546837263297.png?fit=300%2C55&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/applesh2-e1546837263297.png?fit=600%2C110&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/applesh2.png?ssl=1\" alt=\"Beware abnormal data\" class=\"wp-image-1649\"\/><\/figure>\n\n\n\n<p>We can see the increase of volume and the drop of stock price within the dataset. So we <strong>do not need<\/strong> to do extra work for price adjustment in model training. But <strong>for visualization purpose<\/strong>, we can adjust the stock price according to its stock split ratio:<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\" escaped=\"true\">apple_adjusted =  market_train_df.loc[market_train_df['assetName'] == 'Apple Inc']\nebay_adjusted =  market_train_df.loc[market_train_df['assetName'] == 'eBay Inc']\napple_adjusted.loc[(apple_adjusted['time'] < '2014-06-09'), 'close'] = apple_adjusted.loc[apple_adjusted['time'] < '2014-06-09']['close']\/7\nebay_adjusted.loc[(ebay_adjusted['time'] < '2015-07-20'), 'close'] = ebay_adjusted.loc[ebay_adjusted['time'] < '2015-07-20']['close']\/2376*1000\n<\/pre>\n\n\n\n<p>And we have the updated chart:<\/p>\n\n\n\n<iframe loading=\"lazy\" width=\"800\" height=\"500\" frameborder=\"0\" scrolling=\"no\" src=\"\/\/plot.ly\/~codeastar\/21.embed\"><\/iframe>\n\n\n\n<p>It lets us know that stock price and volume are correlated in our dataset. <\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Find Outliers<\/h3>\n\n\n\n<p>Since we have ~3800 stocks in the 10 years dataset, there may be some extraordinary data. It is good to have a data integrity check. First, let's find stocks with more than 300% change in price within the same day. <\/p>\n\n\n\n<pre lang=\"python\" line=\"1\" escaped=\"true\">market_train_df['price_diff'] = (market_train_df['close']market_train_df['open'])\/market_train_df['open']\nmarket_train_df.loc[(market_train_df['price_diff'] > 3) | (market_train_df['price_diff'] < -0.75)]\n<\/pre>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" decoding=\"async\" data-attachment-id=\"1655\" data-permalink=\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/outlier\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/outlier-e1546920315423.png?fit=500%2C68&amp;ssl=1\" data-orig-size=\"500,68\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"outlier\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/outlier-e1546920315423.png?fit=300%2C41&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/outlier-e1546920315423.png?fit=500%2C68&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/outlier.png?ssl=1\" alt=\"outlier\" class=\"wp-image-1655\"\/><\/figure>\n\n\n\n<p>Then we find a stock with 0.01 opening price in one day and 999.99 opening price in another day. It should be some kind of data error. <\/p>\n\n\n\n<p>We do the same checking on <em>returnsClosePrevRaw1<\/em> and <em>returnsOpenPrevRaw1<\/em>, where <em>returnsClosePrevRaw1<\/em> and <em>returnsOpenPrevRaw1<\/em> are values storing the open or close price change ratio from previous day.<\/p>\n\n\n\n<pre class=\"wp-block-code\"><code>returnsClosePrevRaw1 = (Current Day Close Price - Previous Day Close Price) \/ Previous Day Close Price\nreturnsOpenPrevRaw1 = (Current Day Open Price - Previous Day Open Price) \/ Previous Day Open Price<\/code><\/pre>\n\n\n\n<p>And we remove those outliers from our dataset:<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\" escaped=\"true\">market_train_df= market_train_df.loc[market_train_df['returnsOpenPrevRaw1'].abs() <= 3] \nmarket_train_df= market_train_df.loc[market_train_df['returnsClosePrevRaw1'].abs() <= 3]\nmarket_train_df = market_train_df.loc[market_train_df['close']\/market_train_df['open'] <= 3]\n<\/pre>\n\n\n\n<p>We also remove data earlier than 2010-01-01. This action is based on 2 reasons: <\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>to reduce memory usage as we can only use Kaggle's 17GB kernel in this challenge<\/li><li>to skip the data dated in the 2008 global financial crisis period<\/li><\/ol>\n\n\n\n<pre lang=\"python\" line=\"1\" escaped=\"true\">news_train_df = news_train_df.loc[news_train_df['time'] &gt;= '2010-01-01 22:00:00+0000']\n<\/pre>\n\n\n\n<p>On <em>news_train_df<\/em> dataframe, let' see rather we can find something useful there.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">headtag_df  = news_train_df.groupby(['headlineTag']).size().to_frame('count').reset_index().sort_values('count', ascending=False)\ntrace = go.Bar(\n        x = headtag_df['headlineTag'],\n        y = headtag_df['count']\n    )  \nlayout = go.Layout(\n            title = \"Headline Tag Count\",\n            xaxis=dict(\n              title=\"Tag\",             \n              tickangle=45,\n            ),\n            yaxis=dict(\n              title=\"Count\"\n            ), \n         )\ndata = [trace]\nfig = go.Figure(data=data, layout=layout)\npy.iplot(fig)\n<\/pre>\n\n\n\n<iframe loading=\"lazy\" width=\"800\" height=\"500\" frameborder=\"0\" scrolling=\"no\" src=\"\/\/plot.ly\/~codeastar\/23.embed\"><\/iframe>\n\n\n\n<p>Most of the news has no headline tag and even existing tags are widely distributed, hardly provide important information to us. We should remove this feature when we build our machine learning model.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Build the Model<\/h3>\n\n\n\n<p>Since we are going to predict the stock trend, the objective of our model is to find the confidence of a stock going up or down in next 10 days (i.e. the <em>returnsOpenNextMktres10<\/em> field). Before we start to build our model, first and foremost, we need to merge 2 training datasets \"<em>market_train_df<\/em>\" and \"<em>news_train_df<\/em>\", into one.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">def mergeDF(market_df, news_df):\n    market_df['time'] = market_df.time.dt.date\n    market_df['returnsOpenPrevRaw1_to_volume'] = market_df['returnsOpenPrevRaw1'] \/ market_df['volume']\n    market_df['close_to_open'] = market_df['close'] \/ market_df['open']\n    market_df['average_price'] = (market_df['close'] + market_df['open'])\/2\n    market_df['close_price_volume'] = market_df['volume'] * market_df['close']\n    \n    news_df['firstCreated'] = news_df.firstCreated.dt.date\n    news_df['time'] = news_df.time.dt.hour  \n    news_df['sentence_word_count'] =  news_df['wordCount'] \/ news_df['sentenceCount']\n    news_df['assetCodesLen'] =  news_df['assetCodes'].map(lambda x: len(eval(x)))\n    news_df['assetCode_0'] = news_df['assetCodes'].map(lambda x: list(eval(x))[0])\n    news_df['headlineLen'] = news_df['headline'].apply(lambda x: len(x))\n    \n    news_df = news_df.groupby(['firstCreated', 'assetCode_0'], as_index=False).mean()\n   \n    #merge market with news df\n    market_df = pd.merge(market_df, news_df, how='left', left_on=['time', 'assetCode'], \n                            right_on=['firstCreated', 'assetCode_0'])   #use left join, i.e. all depend on market df\n    \n    return market_df\n\nmerged_train_df = mergeDF(market_train_df, news_train_df)\n<\/pre>\n\n\n\n<p>Our new dataset, <em>merged_train_df<\/em>, is grouped by the same date and the same stock code (<em>assetCode<\/em>).  Since price is an important feature, we add new price features like <em>average_price<\/em> and <em>close_price_volume<\/em>.<\/p>\n\n\n\n<p>After that we can start creating our training and validating datasets from our merged one. Remember, we are predicting stock prices' future trend, not their future value.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\" escaped=\"true\">selected_cols = ]\n\ntrain_x = market_train_df[selected_cols].values\ntrain_y = market_train_df.returnsOpenNextMktres10 &gt;= 0\ntrain_y = train_y.values\n\nfrom sklearn import model_selection\nX_train, X_test, Y_train, Y_test = model_selection.train_test_split(train_x, train_y, test_size=0.2, random_state=3)\n<\/pre>\n\n\n\n<p>Now we have training and validating datasets, it is time to make our training model. We can use basic parameters for our first model. But since we are predicting the price's up or down trend, we have to set our model objective to \"binary\". <\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">import lightgbm as lgb\nparams = {'learning_rate': 0.01, \n          'max_depth': 12, \n          'boosting': 'gbdt', \n          'objective': 'binary', \n          'metric': 'auc', \n          'seed': 33}\n\nmodel = lgb.train(params, train_set=lgb.Dataset(X_train, label=Y_train), num_boost_round=5000,\n                  valid_sets=[lgb.Dataset(X_train, label=Y_train), lgb.Dataset(X_test, label=Y_test)],\n                  verbose_eval=100, early_stopping_rounds=100)\n<\/pre>\n\n\n\n<p>After that we can sit and wait for our first training outcome.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Predict the Trend<\/h3>\n\n\n\n<p>Before we do the prediction, let's take a look on feature importance chart from our model.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">impo_df = pd.DataFrame({'imp': model.feature_importance(), 'col':selected_cols})\nimpo_df = impo_df.sort_values(['imp','col'], ascending=[True, False])\ncolors=[]\nfor i in range(len(selected_cols)):   \n   if i % 2 == 0:\n    colors.append('#EFC62E')\n   else:\n    colors.append('#EF7D2E')\ndata=[]\ntrace = go.Bar(\n            orientation = 'h',\n            x = impo_df.imp,\n            y = impo_df.col,\n            marker=dict(\n              color= colors,\n            )\n        )\ndata.append(trace)\nlayout = go.Layout(\n                title = \"Feature Importance Chart\",\n                titlefont=dict(size=25),\n                xaxis=dict(\n                  title='Importance',   \n                  titlefont=dict(size=20),\n                ),\n                yaxis=dict(\n                  title=\"Feature\",\n                  tickangle=45,\n                  automargin=True,\n                  titlefont=dict(size=20),\n                )               \n             )\nfig = go.Figure(data=data, layout=layout)\npy.iplot(fig)\n<\/pre>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"700\" height=\"450\" data-attachment-id=\"1670\" data-permalink=\"https:\/\/www.codeastar.com\/newplot\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/newplot.png?fit=700%2C450&amp;ssl=1\" data-orig-size=\"700,450\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"newplot\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/newplot.png?fit=300%2C193&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/newplot.png?fit=700%2C450&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/newplot.png?resize=700%2C450&#038;ssl=1\" alt=\"Stock trading model Importance Chart\" class=\"wp-image-1670\" srcset=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/newplot.png?w=700&amp;ssl=1 700w, https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/newplot.png?resize=300%2C193&amp;ssl=1 300w\" sizes=\"auto, (max-width: 700px) 100vw, 700px\" \/><\/figure>\n\n\n\n<p>From the chart, we find that marketing features score higher in importance than news features.<\/p>\n\n\n\n<p>We get the testing dataset using Kaggle's specified API, <em>env.get_prediction_days()<\/em>. Then we can predict the outcome in a date loop and submit to Kaggle.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">days = env.get_prediction_days()\n\nstart_time = time.time()\nfor market_test_df, news_test_df, pred_template_df in days:\n    market_test_df = mergeDF(market_test_df, news_test_df)\n    #fill up with zero values\n    X_live = market_test_df[selected_cols].values\n    predictions = model.predict(X_live, num_iteration=model.best_iteration)\n    confidence = 2 * predictions -1    \n    preds_df = pd.DataFrame({'assetCode':market_test_df['assetCode'],'confidence':confidence})\n    pred_template_df['confidenceValue'][pred_template_df['assetCode'].isin(preds_df.assetCode)] = preds_df['confidence'].values\n    env.predict(pred_template_df)\n\nprint(\"Time used for prediction: {} seconds\".format(time.time()-start_time)) \nenv.write_submission_file()\n<\/pre>\n\n\n\n<h3 class=\"wp-block-heading\">Conclusion<\/h3>\n\n\n\n<p>Finally, our current model can score about 0.647 (mean divided by the standard deviation of daily&nbsp;confidence, the closer to 1 the better). It is definitely not helping us to get rich in stock market :]] . <strong>But<\/strong>, the main point of this exercise is, to learn the know-how on building our stock trading prediction model. So we summarize what we have experienced:<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>market data is more important than news data<\/li><li>price related features are important<\/li><li>there may be outliers in training data <\/li><li>there are global events which affect all stock prices in general<\/li><\/ul>\n\n\n\n<p>I would suggest we focus on price and volume features and apply certain day range patterns, in order to make good our model and get better result next time.<\/p>\n\n\n\n<div style=\"height:135px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">What have we learnt in this post?<\/h3>\n\n\n\n<ol class=\"wp-block-list\"><li>Feature engineering for stock trading datasets<\/li><li>Data integrity on stock training datasets<\/li><li>Handling on market news<\/li><li>Stock prediction result criteria<\/li><\/ol>\n","protected":false},"excerpt":{"rendered":"<p>Okay, I admit it, it looks like a clickbait headline :]] (yes, we did the similar thing before :]] ). But this is not a clickbait at all, as we are actually discussing this topic this time. There is a Kaggle&#8217;s challenge on predicting stock trading trend, which is a good fit for our topic. [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":1691,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"site-sidebar-layout":"default","site-content-layout":"default","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"default","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false},"version":2}},"categories":[18],"tags":[127,82,22,126,125,128],"class_list":["post-1631","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-machine-learning","tag-feature-engineering","tag-lgb","tag-machine-learning","tag-stock","tag-stock-market","tag-stock-trading"],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.9 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Stock Trading with Machine Learning and Get Rich &#8902; Code A Star<\/title>\n<meta name=\"description\" content=\"In this exercise, we use Kaggle&#039; stock trading prediction challenge datasets to make our stock trend machine learning model with Python and LGB.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Stock Trading with Machine Learning and Get Rich &#8902; Code A Star\" \/>\n<meta property=\"og:description\" content=\"In this exercise, we use Kaggle&#039; stock trading prediction challenge datasets to make our stock trend machine learning model with Python and LGB.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/\" \/>\n<meta property=\"og:site_name\" content=\"Code A Star\" \/>\n<meta property=\"article:publisher\" content=\"codeastar\" \/>\n<meta property=\"article:author\" content=\"codeastar\" \/>\n<meta property=\"article:published_time\" content=\"2019-01-16T20:50:09+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2019-01-17T02:59:31+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"800\" \/>\n\t<meta property=\"og:image:height\" content=\"432\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Raven Hon\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@codeastar\" \/>\n<meta name=\"twitter:site\" content=\"@codeastar\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Raven Hon\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/\"},\"author\":{\"name\":\"Raven Hon\",\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"headline\":\"Stock Trading with Machine Learning and Get Rich\",\"datePublished\":\"2019-01-16T20:50:09+00:00\",\"dateModified\":\"2019-01-17T02:59:31+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/\"},\"wordCount\":1113,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"image\":{\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1\",\"keywords\":[\"feature engineering\",\"LGB\",\"Machine Learning\",\"stock\",\"stock market\",\"stock trading\"],\"articleSection\":[\"Learn Machine Learning\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/\",\"url\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/\",\"name\":\"Stock Trading with Machine Learning and Get Rich &#8902; Code A Star\",\"isPartOf\":{\"@id\":\"https:\/\/www.codeastar.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1\",\"datePublished\":\"2019-01-16T20:50:09+00:00\",\"dateModified\":\"2019-01-17T02:59:31+00:00\",\"description\":\"In this exercise, we use Kaggle' stock trading prediction challenge datasets to make our stock trend machine learning model with Python and LGB.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1\",\"width\":800,\"height\":432,\"caption\":\"Stock Trading with Machine Learning\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.codeastar.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Stock Trading with Machine Learning and Get Rich\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.codeastar.com\/#website\",\"url\":\"https:\/\/www.codeastar.com\/\",\"name\":\"Code A Star\",\"description\":\"We don&#039;t wish upon a star, we code a star\",\"publisher\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.codeastar.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\",\"name\":\"Raven Hon\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1\",\"width\":70,\"height\":70,\"caption\":\"Raven Hon\"},\"logo\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/\"},\"description\":\"Raven Hon is\u00a0a 20 years+ veteran in information technology industry who has worked on various projects from console, web, game, banking and mobile applications in different sized companies.\",\"sameAs\":[\"https:\/\/www.codeastar.com\",\"codeastar\",\"https:\/\/x.com\/codeastar\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Stock Trading with Machine Learning and Get Rich &#8902; Code A Star","description":"In this exercise, we use Kaggle' stock trading prediction challenge datasets to make our stock trend machine learning model with Python and LGB.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/","og_locale":"en_US","og_type":"article","og_title":"Stock Trading with Machine Learning and Get Rich &#8902; Code A Star","og_description":"In this exercise, we use Kaggle' stock trading prediction challenge datasets to make our stock trend machine learning model with Python and LGB.","og_url":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/","og_site_name":"Code A Star","article_publisher":"codeastar","article_author":"codeastar","article_published_time":"2019-01-16T20:50:09+00:00","article_modified_time":"2019-01-17T02:59:31+00:00","og_image":[{"width":800,"height":432,"url":"https:\/\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png","type":"image\/png"}],"author":"Raven Hon","twitter_card":"summary_large_image","twitter_creator":"@codeastar","twitter_site":"@codeastar","twitter_misc":{"Written by":"Raven Hon","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#article","isPartOf":{"@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/"},"author":{"name":"Raven Hon","@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"headline":"Stock Trading with Machine Learning and Get Rich","datePublished":"2019-01-16T20:50:09+00:00","dateModified":"2019-01-17T02:59:31+00:00","mainEntityOfPage":{"@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/"},"wordCount":1113,"commentCount":0,"publisher":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"image":{"@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1","keywords":["feature engineering","LGB","Machine Learning","stock","stock market","stock trading"],"articleSection":["Learn Machine Learning"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/","url":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/","name":"Stock Trading with Machine Learning and Get Rich &#8902; Code A Star","isPartOf":{"@id":"https:\/\/www.codeastar.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#primaryimage"},"image":{"@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1","datePublished":"2019-01-16T20:50:09+00:00","dateModified":"2019-01-17T02:59:31+00:00","description":"In this exercise, we use Kaggle' stock trading prediction challenge datasets to make our stock trend machine learning model with Python and LGB.","breadcrumb":{"@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#primaryimage","url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1","width":800,"height":432,"caption":"Stock Trading with Machine Learning"},{"@type":"BreadcrumbList","@id":"https:\/\/www.codeastar.com\/get-rich-stock-trading-machine-learning\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.codeastar.com\/"},{"@type":"ListItem","position":2,"name":"Stock Trading with Machine Learning and Get Rich"}]},{"@type":"WebSite","@id":"https:\/\/www.codeastar.com\/#website","url":"https:\/\/www.codeastar.com\/","name":"Code A Star","description":"We don&#039;t wish upon a star, we code a star","publisher":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.codeastar.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd","name":"Raven Hon","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/","url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1","width":70,"height":70,"caption":"Raven Hon"},"logo":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/"},"description":"Raven Hon is\u00a0a 20 years+ veteran in information technology industry who has worked on various projects from console, web, game, banking and mobile applications in different sized companies.","sameAs":["https:\/\/www.codeastar.com","codeastar","https:\/\/x.com\/codeastar"]}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/01\/stock1.png?fit=800%2C432&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p8PcRO-qj","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1631","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/comments?post=1631"}],"version-history":[{"count":42,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1631\/revisions"}],"predecessor-version":[{"id":1692,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1631\/revisions\/1692"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/media\/1691"}],"wp:attachment":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/media?parent=1631"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/categories?post=1631"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/tags?post=1631"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}