{"id":163,"date":"2017-07-03T18:20:15","date_gmt":"2017-07-03T18:20:15","guid":{"rendered":"http:\/\/www.codeastar.com\/?p=163"},"modified":"2017-07-08T09:52:51","modified_gmt":"2017-07-08T09:52:51","slug":"what-is-data-science","status":"publish","type":"post","link":"https:\/\/www.codeastar.com\/what-is-data-science\/","title":{"rendered":"Data Science: So you want to be a Data Scientist?"},"content":{"rendered":"
\"So
So you want to join a Data Science team?<\/figcaption><\/figure>\n

The Basic<\/h3>\n

Data Science is a trending topic among recent years. \u00a0And a Data Scientist is the #1 Best Job in America<\/a>. But before going further on this topic, let’s go back to a basic question: What is Data Science?<\/em><\/p>\n

<\/p>\n

\"Data
Data Science in Google Trends<\/figcaption><\/figure>\n

Data becomes air in modern day, it is everywhere. When you take a picture, your picture contains data of your camera model, location, date and color information. When you go online shopping, your preference and buying behavior would be served as data for the business owner. And of course, while you are reading this post, Google Analytics is recording your data as well.<\/p>\n

Data grows rapidly day after day, it provides valuable information for making business decisions. And at the same time, it just turns out being too big to handle. Likes diving into the big deep Data Ocean to find our answers.\u00a0People then deserve a more effective and scientific way to handle data, thus we have Data Science.<\/p>\n

Key components of Data Science<\/h3>\n

Problem in a specified domain<\/strong><\/h4>\n

(oh wait, are you expecting me saying “Data” is the first component of Data Science? :]] )<\/em>
\nWe use Data Science to look for a solution. But there is always no solution without a problem. That is why we need a problem to solve before we start our “science”. \u00a0It could be “How to boost the sale on certain items”, “How to increase the success rate for certain rescue operations” or others.<\/p>\n

Data set<\/strong><\/h4>\n

Back to our key word in Data Science, data. A data set is a collection of data entities with attributes and behaviors in certain events. For example, we would like to boost our sale volume, so we look for our data set: the sale transaction record. The data set contains:<\/p>\n