1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950 |
- ---
- interact_link: content/chapters/01/1/2/statistical-techniques.md
- kernel_name:
- has_widgets: false
- title: |-
- Statistical Techniques
- prev_page:
- url: /chapters/01/1/1/computational-tools.html
- title: |-
- Computational Tools
- next_page:
- url: /chapters/01/2/why-data-science.html
- title: |-
- Why Data Science?
- comment: "***PROGRAMMATICALLY GENERATED, DO NOT EDIT. SEE ORIGINAL FILES IN /content***"
- ---
- <div class="jb_cell">
- <div class="cell border-box-sizing text_cell rendered"><div class="inner_cell">
- <div class="text_cell_render border-box-sizing rendered_html">
- <h2 id="Statistical-Techniques">Statistical Techniques<a class="anchor-link" href="#Statistical-Techniques"> </a></h2><p>The discipline of statistics has long addressed the same fundamental challenge
- as data science: how to draw robust conclusions about the world using incomplete
- information. One of the most important contributions of statistics is a
- consistent and precise vocabulary for describing the relationship between
- observations and conclusions. This text continues in the same tradition,
- focusing on a set of core inferential problems from statistics: testing
- hypotheses, estimating confidence, and predicting unknown quantities.</p>
- <p>Data science extends the field of statistics by taking full advantage of
- computing, data visualization, machine learning, optimization, and access
- to information. The combination of fast computers and the Internet gives
- anyone the ability to access and analyze
- vast datasets: millions of news articles, full encyclopedias, databases for
- any domain, and massive repositories of music, photos, and video.</p>
- <p>Applications to real data sets motivate the statistical techniques that we
- describe throughout the text. Real data often do not follow regular patterns or
- match standard equations. The interesting variation in real data can be lost by
- focusing too much attention on simplistic summaries such as average values.
- Computers enable a family of methods based on resampling that apply to a wide
- range of different inference problems, take into account all available
- information, and require few assumptions or conditions. Although these
- techniques have often been reserved for advanced courses in statistics, their
- flexibility and simplicity are a natural fit for data science applications.</p>
- </div>
- </div>
- </div>
- </div>
-
|