123456789101112131415161718192021222324252627282930313233343536373839404142434445 |
- ---
- interact_link: content/chapters/01/what-is-data-science.md
- kernel_name:
- has_widgets: false
- title: |-
- Data Science
- prev_page:
- url: /chapters/intro.html
- title: |-
- Introduction
- next_page:
- url: /chapters/01/1/intro.html
- title: |-
- Introduction
- comment: "***PROGRAMMATICALLY GENERATED, DO NOT EDIT. SEE ORIGINAL FILES IN /content***"
- ---
- <div class="jb_cell">
- <div class="cell border-box-sizing text_cell rendered"><div class="inner_cell">
- <div class="text_cell_render border-box-sizing rendered_html">
- <h1 id="What-is-Data-Science?">What is Data Science?<a class="anchor-link" href="#What-is-Data-Science?"> </a></h1><p>Data Science is about drawing useful conclusions from large and diverse data
- sets through exploration, prediction, and inference. Exploration involves
- identifying patterns in information. Prediction involves using information
- we know to make informed guesses about values we wish we knew. Inference
- involves quantifying our degree of certainty: will the patterns that we found in our data also appear in new observations? How accurate are our predictions? Our primary
- tools for exploration are visualizations and descriptive statistics, for
- prediction are machine learning and optimization, and for inference are
- statistical tests and models.</p>
- <p>Statistics is a central component of data science because statistics
- studies how to make robust conclusions based on incomplete information. Computing
- is a central component because programming allows us to apply analysis
- techniques to the large and diverse data sets that arise in real-world
- applications: not just numbers, but text, images, videos, and sensor readings.
- Data science is all of these things, but it is more than the sum of its parts
- because of the applications. Through understanding a particular domain, data
- scientists learn to ask appropriate questions about their data and correctly
- interpret the answers provided by our inferential and computational tools.</p>
- </div>
- </div>
- </div>
- </div>
-
|