{"id":1941,"date":"2019-05-15T19:27:29","date_gmt":"2019-05-15T19:27:29","guid":{"rendered":"https:\/\/www.codeastar.com\/?p=1941"},"modified":"2019-05-15T19:27:42","modified_gmt":"2019-05-15T19:27:42","slug":"recurrent-neural-network-rnn-in-nlp-and-python-part-2","status":"publish","type":"post","link":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/","title":{"rendered":"RNN (Recurrent Neural Network) in NLP and Python &#8211; Part 2"},"content":{"rendered":"\n<p>From our <a href=\"https:\/\/www.codeastar.com\/word-embedding-in-nlp-and-python-part-1\/\">Part 1<\/a> of NLP and Python topic, we talked about word pre-processing for a machine to handle words. This time, we are going to talk about building a model for a machine to classify words. We learned to <a href=\"https:\/\/www.codeastar.com\/convolutional-neural-network-python\/\">use CNN to classify images<\/a> in past. Then we use another neural network, Recurrent Neural Network (RNN), to classify words now.<\/p>\n\n\n\n<!--more-->\n\n\n\n<h3 class=\"wp-block-heading\">What is Recurrent Neural Network (RNN)? <\/h3>\n\n\n\n<p>RNN is a class of deep neural networks and so is the CNN. Then what is the major difference between CNN and RNN? The spelling. (okay, don&#8217;t laugh, I&#8217;m serious :]] ) The &#8220;R&#8221; of RNN stands for Recurrent. It means process is occupied repeatedly and this is the feature we don&#8217;t see in CNN.<\/p>\n\n\n\n<p>In CNN, we call it a feed-forward network. While the input of layer 2 is the output of layer 1, the input of layer 3 is the output of layer 2 and the list goes on. But in RNN, things go repeatedly, as the inputs of layer 2 <strong>are<\/strong> the output of layer 1 <strong>and<\/strong> also the output of layer 2. A RNN not only produces output, it can copy and loop it back to the network. It turns out RNN is a neural network with memory.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"530\" height=\"298\" data-attachment-id=\"1945\" data-permalink=\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/30chvx\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/30chvx-e1557285255637.jpg?fit=530%2C298&amp;ssl=1\" data-orig-size=\"530,298\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"30chvx\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/30chvx-e1557285255637.jpg?fit=300%2C169&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/30chvx-e1557285255637.jpg?fit=530%2C298&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/30chvx-e1557285255637.jpg?resize=530%2C298&#038;ssl=1\" alt=\"RNN, neural network with memory\" class=\"wp-image-1945\"\/><\/figure><\/div>\n\n\n\n<p>It also makes RNN strong on handling sequence of data to predict precise outcome. The content of sequential data, e.g. speech, depends on how data is connected. When we have &#8220;Have&#8221;, &#8220;a&#8221; and &#8220;nice&#8221; as inputs, RNN remembers those inputs and predict &#8220;day&#8221; as output. That is also why RNN is widely used on text recognition and translation applications. We can explain a RNN with following diagram:<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"269\" data-attachment-id=\"1950\" data-permalink=\"https:\/\/www.codeastar.com\/rnn-unrolled\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/RNN-unrolled-e1557291926172.png?fit=800%2C210&amp;ssl=1\" data-orig-size=\"800,210\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"RNN-unrolled\" data-image-description=\"\" data-image-caption=\"&lt;p&gt;(image source: http:\/\/colah.github.io\/posts\/2015-08-Understanding-LSTMs\/)&lt;\/p&gt;\n\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/RNN-unrolled-e1557291926172.png?fit=300%2C79&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/RNN-unrolled-e1557291926172.png?fit=1024%2C269&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/RNN-unrolled.png?resize=1024%2C269&#038;ssl=1\" alt=\"Inside a RNN layer\" class=\"wp-image-1950\"\/><figcaption>(image source: <a href=\"http:\/\/colah.github.io\/posts\/2015-08-Understanding-LSTMs\/\" target=\"_blank\" rel=\"noreferrer noopener\" aria-label=\"http:\/\/colah.github.io\/posts\/2015-08-Understanding-LSTMs\/ (opens in a new tab)\">http:\/\/colah.github.io\/posts\/2015-08-Understanding-LSTMs\/<\/a>)<\/figcaption><\/figure><\/div>\n\n\n\n<p>First, an input, <em>X_t,<\/em> passes through RNN, <em>A<\/em>. It starts from the first round. We call the first chunk of input as <em>X_0<\/em>. RNN then produces hidden output <em>h_0<\/em>.  Then we go for the next round with input <em>X_1<\/em>, <em>h_0<\/em> is added to the RNN, and we have hidden output <em>h_1<\/em>. The flow goes again and again until we put all our input into <em>A<\/em>. Finally, we have <em>h_t<\/em> as our output which trains with previous inputs. <\/p>\n\n\n\n<h3 class=\"wp-block-heading\">RNN in Python<\/h3>\n\n\n\n<p>From our <a rel=\"noreferrer noopener\" href=\"https:\/\/www.codeastar.com\/convolutional-neural-network-python\/\">Python Image Recognizer<\/a> post, we built a CNN model for image classification with <a rel=\"noreferrer noopener\" aria-label=\"Keras (opens in a new tab)\" href=\"https:\/\/keras.io\/\" target=\"_blank\">Keras<\/a>. This time, we are going to use the Keras library again, but for a RNN model. Firstly, let&#8217;s import required modules.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">from keras.preprocessing.text import Tokenizer\nfrom keras.preprocessing.sequence import pad_sequences\nfrom keras.layers import Embedding, Input, Dense, CuDNNLSTM, concatenate, Bidirectional, SpatialDropout1D, Conv1D, GlobalAveragePooling1D, GlobalMaxPooling1D\nfrom keras.optimizers import Adam\nfrom keras.models import Model\nfrom keras.callbacks import EarlyStopping, ModelCheckpoint\nimport keras.backend as K\nfrom sklearn.model_selection import KFold\n<\/pre>\n\n\n\n<p>Then we apply the word pre-processing function, <em>punct_apo_fix<\/em>, from Part 1 and pre-process our training and testing data. <\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">df_merge = pd.concat([df_train[['id','comment_text']], df_test], axis=0)\ndf_merge[\"comment_text\"] = df_merge[\"comment_text\"].apply(lambda x: punct_apo_fix(x))\ndf_train_comment = df_merge.iloc[:df_train.shape[0],:]\ndf_test_comment = df_merge.iloc[df_train.shape[0]:,:]\ndf_train_comment = pd.concat([df_train_comment,df_train[['target']]],axis=1)\n<\/pre>\n\n\n\n<p>Now we normalize our training data and get the word vector indexes from the pre-trained fastText model.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\" escaped=\"true\">df_train_comment['target'] = np.where(df_train_comment['target'] &gt;= 0.5, True, False)\ntokenizer = Tokenizer(num_words=100000)\ntokenizer.fit_on_texts(list(df_train_comment['comment_text']) + list(df_test_comment['comment_text']))\ntotal_unique_word = len(tokenizer.word_index) + 1\nwordvectors_index = KeyedVectors.load_word2vec_format(fasttext_300d_2m_model)\n<\/pre>\n\n\n\n<p>Next, we apply the fastText word vector indexes into words found from our training and testing data.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">EMBEDDINGS_DIMENSION = 300\nembedding_matrix = np.zeros((total_unique_word,EMBEDDINGS_DIMENSION))\nfor word, i in tokenizer.word_index.items():\n    if word in wordvectors_index.vocab:\n        embedding_matrix[i] = wordvectors_index[word]\n<\/pre>\n\n\n\n<p>Okay, we are going to the fun part of this project &#8212; build the model!<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">RNN model with LSTM and Bidirectional Structure<\/h3>\n\n\n\n<p>Before we start to build our model, there are 2 techniques we can apply on RNN to make good the model. They are <a rel=\"noreferrer noopener\" aria-label=\"Long Short-Term Memory (opens in a new tab)\" href=\"https:\/\/en.wikipedia.org\/wiki\/Long_short-term_memory\" target=\"_blank\">Long Short-Term Memory<\/a> (LSTM) and <a rel=\"noreferrer noopener\" aria-label=\"Bidirectional (opens in a new tab)\" href=\"https:\/\/en.wikipedia.org\/wiki\/Bidirectional_recurrent_neural_networks\" target=\"_blank\">Bidirectional<\/a> RNN. Let&#8217; start with LSTM first.<\/p>\n\n\n\n<p>When our input data is &#8220;<em>People in Japan speak&#8230;<\/em> &#8220;. We can expect the output should be &#8220;<em>Japanese<\/em>&#8221; from our human&#8217;s mind. But from a machine&#8217;s perspective, it does not have enough information to generate the output. It needs the relationship of &#8220;Japan&#8221; and &#8220;Japanese&#8221; from other inputs. That is why we need LSTM to extend RNN memory for not only current input but also previous inputs. <\/p>\n\n\n\n<p>Then we go for the Bidirectional RNN. The concept of Bidirectional structure is straight-forward. It duplicates a recurrent layer but in reverse order. When we have &#8220;<em>Have a nice day<\/em>.&#8221; as input, it will turn out becoming 2 layers with &#8220;<em>Have<\/em>&#8220;, &#8220;<em>a<\/em>&#8220;, &#8220;<em>nice<\/em>&#8220;, &#8220;<em>day<\/em>&#8220;, &#8220;<em>.<\/em>&#8221; and &#8220;<em>.<\/em>&#8220;, &#8220;<em>day<\/em>&#8220;, &#8220;<em>nice<\/em>&#8220;, &#8220;<em>a<\/em>&#8220;, &#8220;<em>Have<\/em>&#8220;. So what is the benefit of using Bidirectional structure? Let&#8217;s think about &#8220;<em>We go to a __________ to have lunch there<\/em>&#8220;. The RNN reads &#8220;<em>We<\/em>&#8220;, &#8220;<em>go<\/em>&#8221; &#8220;<em>to<\/em>&#8221; forwardly and &#8220;<em>there<\/em>&#8220;, &#8220;<em>lunch<\/em>&#8220;, &#8220;<em>have<\/em>&#8221; backwardly. Then it can predict &#8220;<em>restaurant<\/em>&#8221; by relating inputs from the 2 layers.<\/p>\n\n\n\n<p>Now we put those techniques into our model:<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">MAX_SEQUENCE_LENGTH = 256\ndef build_model(total_unique_word, embedding_matrix):\n    sequence_input = Input(shape=(MAX_SEQUENCE_LENGTH,), dtype='int32')\n    embedding_layer = Embedding(total_unique_word,\n                            EMBEDDINGS_DIMENSION,\n                            weights=[embedding_matrix],\n                            input_length=MAX_SEQUENCE_LENGTH,\n                            trainable=False)\n    x = embedding_layer(sequence_input)\n    x = SpatialDropout1D(0.2)(x)\n    x = Bidirectional(CuDNNLSTM(64, return_sequences=True))(x)   \n    x = Conv1D(64, kernel_size = 2, padding = \"valid\", kernel_initializer = \"he_uniform\")(x)\n    avg_pool1 = GlobalAveragePooling1D()(x)\n    max_pool1 = GlobalMaxPooling1D()(x)     \n    x = concatenate([avg_pool1, max_pool1])\n    preds = Dense(1, activation='sigmoid')(x)\n    model = Model(sequence_input, preds)\n    model.compile(loss='binary_crossentropy',\n              optimizer=Adam(),\n              metrics=['acc'])\n    return model\n<\/pre>\n\n\n\n<p>As we mentioned in Part 1, a machine handles words using word vectors. That is why we apply <em>Embedding<\/em> layer to convert input to vector. And you may notice that, instead of using <em>Droupout <\/em>layer, we use <em>SpatialDropout1D<\/em>. It will drop entire 1D feature maps, and make bigger difference for machine to learn.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Predict Comment Classification<\/h3>\n\n\n\n<p>Data pre-processing, check. Model building, check. Now it is the time we go for predicting those comments are toxic or not. We also apply 3-fold <a href=\"https:\/\/www.codeastar.com\/choose-machine-learning-models-python\/\">cross validation<\/a> to enhance our prediction.<\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">train_text = pad_sequences(tokenizer.texts_to_sequences(df_train_comment[\"comment\"]), maxlen=MAX_SEQUENCE_LENGTH)\ntest_text = pad_sequences(tokenizer.texts_to_sequences(df_test_comment[\"comment\"]), maxlen=MAX_SEQUENCE_LENGTH)\ntrain_target = df_train_comment[\"target\"]\nn_splits=3\nsplits = list(KFold(n_splits).split(train_text,train_target))\ntest_preds = np.zeros((df_test_comment.shape[0]))\nfor fold in list(range(n_splits)):\n    K.clear_session()\n    tr_ind, val_ind = splits[fold]\n    checkpoint = ModelCheckpoint(f'gru_{fold}.hdf5', save_best_only = True)\n    earlystop = EarlyStopping(monitor='val_loss', mode='min', verbose=1, patience=3)\n    model = build_model()\n    model.fit(train_text[tr_ind],\n        train_target[tr_ind],\n        batch_size=2048,\n        epochs=100,\n        validation_data=(train_text[val_ind], train_target[val_ind]),\n        callbacks = [earlystop,checkpoint])\n    test_preds += model.predict(test_text)[:,0]\ntest_preds \/= n_splits\nsubmission = pd.read_csv('..\/input\/jigsaw-unintended-bias-in-toxicity-classification\/sample_submission.csv', index_col='id')\nsubmission['prediction'] = test_preds\nsubmission.reset_index(drop=False, inplace=True)\n<\/pre>\n\n\n\n<p>After around 2 hours processing time, we have the prediction and we can discover how well a machine can do for a human&#8217;s work. <\/p>\n\n\n\n<pre lang=\"python\" line=\"1\">validation_df = pd.merge(test_df, submission, on='id')\nvalidation_df[validation_df.prediction > 0.5].head()\nvalidation_df[validation_df.prediction < 0.5].head()\n<\/pre>\n\n\n\n<p>This is a group of machine classified toxic comments: <\/p>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"196\" data-attachment-id=\"1968\" data-permalink=\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/comment1\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/comment1-e1557764336664.png?fit=800%2C153&amp;ssl=1\" data-orig-size=\"800,153\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"comment1\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/comment1-e1557764336664.png?fit=300%2C57&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/comment1-e1557764336664.png?fit=1024%2C196&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/comment1.png?resize=1024%2C196&#038;ssl=1\" alt=\"Toxic Comments\" class=\"wp-image-1968\"\/><\/figure>\n\n\n\n<p> And this is a group of non-toxic comments:  <\/p>\n\n\n\n<figure class=\"wp-block-image\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"190\" data-attachment-id=\"1969\" data-permalink=\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/comment2\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/comment2-e1557764369647.png?fit=800%2C148&amp;ssl=1\" data-orig-size=\"800,148\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"comment2\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/comment2-e1557764369647.png?fit=300%2C56&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/comment2-e1557764369647.png?fit=1024%2C190&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/comment2.png?resize=1024%2C190&#038;ssl=1\" alt=\"Non-toxic comments\" class=\"wp-image-1969\"\/><\/figure>\n\n\n\n<p>I do not have any issue for non-toxic comments, just the toxic comments part is a bit, \"high moral standard\" :]] .<\/p>\n\n\n\n<p>When I submit above prediction to Kaggle, it turns out scoring 0.92x , i.e. 92.x% accuracy. There is still room for improvement, keep trying and learning!<\/p>\n\n\n\n<div style=\"height:148px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"> What have we learnt in this post? <\/h3>\n\n\n\n<ol class=\"wp-block-list\"><li>Definition of Recurrent Neural Network<\/li><li>Concept of LSTM<\/li><li>Concept of Bidirectional structure<\/li><li>Building RNN model in Python<\/li><\/ol>\n","protected":false},"excerpt":{"rendered":"<p>From our Part 1 of NLP and Python topic, we talked about word pre-processing for a machine to handle words. This time, we are going to talk about building a model for a machine to classify words. We learned to use CNN to classify images in past. Then we use another neural network, Recurrent Neural [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":1982,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"site-sidebar-layout":"default","site-content-layout":"default","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"default","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"Build a RNN (Recurrent Neural Network) in Python","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false},"version":2}},"categories":[18],"tags":[57,26,146,140,145,144],"class_list":["post-1941","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-machine-learning","tag-deep-learning","tag-k-fold-cross-validation","tag-lstm","tag-nlp","tag-recurrent-neural-network","tag-rnn"],"jetpack_publicize_connections":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.9 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>RNN (Recurrent Neural Network) in NLP and Python - Part 2 &#8902; Code A Star<\/title>\n<meta name=\"description\" content=\"In this post, we continue our journey in NLP. We will discuss using Recurrent Neural Network (RNN) with Python to classify comments from text source.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"RNN (Recurrent Neural Network) in NLP and Python - Part 2 &#8902; Code A Star\" \/>\n<meta property=\"og:description\" content=\"In this post, we continue our journey in NLP. We will discuss using Recurrent Neural Network (RNN) with Python to classify comments from text source.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Code A Star\" \/>\n<meta property=\"article:publisher\" content=\"codeastar\" \/>\n<meta property=\"article:author\" content=\"codeastar\" \/>\n<meta property=\"article:published_time\" content=\"2019-05-15T19:27:29+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2019-05-15T19:27:42+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1000\" \/>\n\t<meta property=\"og:image:height\" content=\"723\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Raven Hon\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@codeastar\" \/>\n<meta name=\"twitter:site\" content=\"@codeastar\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Raven Hon\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/\"},\"author\":{\"name\":\"Raven Hon\",\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"headline\":\"RNN (Recurrent Neural Network) in NLP and Python &#8211; Part 2\",\"datePublished\":\"2019-05-15T19:27:29+00:00\",\"dateModified\":\"2019-05-15T19:27:42+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/\"},\"wordCount\":876,\"commentCount\":3,\"publisher\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"image\":{\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1\",\"keywords\":[\"deep learning\",\"k-fold cross validation\",\"LSTM\",\"NLP\",\"Recurrent Neural Network\",\"RNN\"],\"articleSection\":[\"Learn Machine Learning\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/\",\"url\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/\",\"name\":\"RNN (Recurrent Neural Network) in NLP and Python - Part 2 &#8902; Code A Star\",\"isPartOf\":{\"@id\":\"https:\/\/www.codeastar.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1\",\"datePublished\":\"2019-05-15T19:27:29+00:00\",\"dateModified\":\"2019-05-15T19:27:42+00:00\",\"description\":\"In this post, we continue our journey in NLP. We will discuss using Recurrent Neural Network (RNN) with Python to classify comments from text source.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#primaryimage\",\"url\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1\",\"width\":1000,\"height\":723,\"caption\":\"Recurrent Neural Network\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.codeastar.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"RNN (Recurrent Neural Network) in NLP and Python &#8211; Part 2\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.codeastar.com\/#website\",\"url\":\"https:\/\/www.codeastar.com\/\",\"name\":\"Code A Star\",\"description\":\"We don&#039;t wish upon a star, we code a star\",\"publisher\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.codeastar.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd\",\"name\":\"Raven Hon\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1\",\"contentUrl\":\"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1\",\"width\":70,\"height\":70,\"caption\":\"Raven Hon\"},\"logo\":{\"@id\":\"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/\"},\"description\":\"Raven Hon is\u00a0a 20 years+ veteran in information technology industry who has worked on various projects from console, web, game, banking and mobile applications in different sized companies.\",\"sameAs\":[\"https:\/\/www.codeastar.com\",\"codeastar\",\"https:\/\/x.com\/codeastar\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"RNN (Recurrent Neural Network) in NLP and Python - Part 2 &#8902; Code A Star","description":"In this post, we continue our journey in NLP. We will discuss using Recurrent Neural Network (RNN) with Python to classify comments from text source.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/","og_locale":"en_US","og_type":"article","og_title":"RNN (Recurrent Neural Network) in NLP and Python - Part 2 &#8902; Code A Star","og_description":"In this post, we continue our journey in NLP. We will discuss using Recurrent Neural Network (RNN) with Python to classify comments from text source.","og_url":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/","og_site_name":"Code A Star","article_publisher":"codeastar","article_author":"codeastar","article_published_time":"2019-05-15T19:27:29+00:00","article_modified_time":"2019-05-15T19:27:42+00:00","og_image":[{"width":1000,"height":723,"url":"https:\/\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png","type":"image\/png"}],"author":"Raven Hon","twitter_card":"summary_large_image","twitter_creator":"@codeastar","twitter_site":"@codeastar","twitter_misc":{"Written by":"Raven Hon","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#article","isPartOf":{"@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/"},"author":{"name":"Raven Hon","@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"headline":"RNN (Recurrent Neural Network) in NLP and Python &#8211; Part 2","datePublished":"2019-05-15T19:27:29+00:00","dateModified":"2019-05-15T19:27:42+00:00","mainEntityOfPage":{"@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/"},"wordCount":876,"commentCount":3,"publisher":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"image":{"@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1","keywords":["deep learning","k-fold cross validation","LSTM","NLP","Recurrent Neural Network","RNN"],"articleSection":["Learn Machine Learning"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/","url":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/","name":"RNN (Recurrent Neural Network) in NLP and Python - Part 2 &#8902; Code A Star","isPartOf":{"@id":"https:\/\/www.codeastar.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#primaryimage"},"image":{"@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#primaryimage"},"thumbnailUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1","datePublished":"2019-05-15T19:27:29+00:00","dateModified":"2019-05-15T19:27:42+00:00","description":"In this post, we continue our journey in NLP. We will discuss using Recurrent Neural Network (RNN) with Python to classify comments from text source.","breadcrumb":{"@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#primaryimage","url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1","width":1000,"height":723,"caption":"Recurrent Neural Network"},{"@type":"BreadcrumbList","@id":"https:\/\/www.codeastar.com\/recurrent-neural-network-rnn-in-nlp-and-python-part-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.codeastar.com\/"},{"@type":"ListItem","position":2,"name":"RNN (Recurrent Neural Network) in NLP and Python &#8211; Part 2"}]},{"@type":"WebSite","@id":"https:\/\/www.codeastar.com\/#website","url":"https:\/\/www.codeastar.com\/","name":"Code A Star","description":"We don&#039;t wish upon a star, we code a star","publisher":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.codeastar.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":["Person","Organization"],"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/832d202eb92a3d430097e88c6d0550bd","name":"Raven Hon","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/","url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2018\/08\/logo70.png?fit=70%2C70&ssl=1","width":70,"height":70,"caption":"Raven Hon"},"logo":{"@id":"https:\/\/www.codeastar.com\/#\/schema\/person\/image\/"},"description":"Raven Hon is\u00a0a 20 years+ veteran in information technology industry who has worked on various projects from console, web, game, banking and mobile applications in different sized companies.","sameAs":["https:\/\/www.codeastar.com","codeastar","https:\/\/x.com\/codeastar"]}]}},"jetpack_featured_media_url":"https:\/\/i0.wp.com\/www.codeastar.com\/wp-content\/uploads\/2019\/05\/rnn.png?fit=1000%2C723&ssl=1","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p8PcRO-vj","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1941","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/comments?post=1941"}],"version-history":[{"count":33,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1941\/revisions"}],"predecessor-version":[{"id":1983,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/posts\/1941\/revisions\/1983"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/media\/1982"}],"wp:attachment":[{"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/media?parent=1941"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/categories?post=1941"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.codeastar.com\/wp-json\/wp\/v2\/tags?post=1941"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}