双色球爱彩网专家杀号|爱彩网主页

數據科學:R語言基礎專項課程

Data Science: Foundations using R

This Specialization covers foundational data science tools and techniques, including getting, cleaning, and exploring data, programming in R, and conducting reproducible research.

約翰霍普金斯大學

Coursera

計算機

簡單(初級)

4 個月

本課程由Coursera和Linkshare共同提供
  • 中文, 英語, 法語, 日語, 韓語, 西班牙語, 意大利語, 德語, 葡萄牙語, 俄語, 其他
  • 593

課程概況

Ask the right questions, manipulate data sets, and create visualizations to communicate results.
This Specialization covers foundational data science tools and techniques, including getting, cleaning, and exploring data, programming in R, and conducting reproducible research. Learners who complete this specialization will be prepared to take the Data Science: Statistics and Machine Learning specialization, in which they build a data product using real-world data.

The five courses in this specialization are the very same courses that make up the first half of the Data Science Specialization. This specialization is presented for learners who want to start and complete the foundational part of the curriculum first, before moving onto the more advanced topics in Data Science: Statistics and Machine Learning.

你將學到什么

Use R to clean, analyze, and visualize data.

Learn how to ask the right questions, obtain data, and perform reproducible research.

Use GitHub to manage data science projects.

包含課程

課程1
數據科學家的工具箱(中文版)

In this course you will get an introduction to the main tools and ideas in the data scientist's toolbox. The course gives an overview of the data, questions, and tools that data analysts and data scientists work with. There are two components to this course. The first is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is a practical introduction to the tools that will be used in the program like version control, markdown, git, GitHub, R, and RStudio.

課程2
R 語言程序設計(中文版)

In this course you will learn how to program in R and how to use R for effective data analysis. You will learn how to install and configure software necessary for a statistical programming environment and describe generic programming language concepts as they are implemented in a high-level statistical language. The course covers practical issues in statistical computing which includes programming in R, reading data into R, accessing R packages, writing R functions, debugging, profiling R code, and organizing and commenting R code. Topics in statistical data analysis will provide working examples.

課程3
獲取和整理數據

Before you can work with data you have to get some. This course will cover the basic ways that data can be obtained. The course will cover obtaining data from the web, from APIs, from databases and from colleagues in various formats. It will also cover the basics of data cleaning and how to make data “tidy”. Tidy data dramatically speed downstream data analysis tasks. The course will also cover the components of a complete data set including raw data, processing instructions, codebooks, and processed data. The course will cover the basics needed for collecting, cleaning, and sharing data.

課程4
探索性數據分析

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data. We will cover in detail the plotting systems in R as well as some of the basic principles of constructing data graphics. We will also cover some of the common multivariate statistical techniques used to visualize high-dimensional data.

課程5
可重復性研究

This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations. Reproducibility allows for people to focus on the actual content of a data analysis, rather than on superficial details reported in a written summary. In addition, reproducibility makes an analysis more useful to others because the data and code that actually conducted the analysis are available. This course will focus on literate statistical analysis tools which allow one to publish data analyses in a single document that allows others to easily execute the same analysis to obtain the same results.

預備知識

Some programming experience (in any language) is recommended. We also suggest a working knowledge of mathematics up to algebra (neither calculus or linear algebra are required).

HEC Managing Innovation & Design Thinking – Join Today And Inspire Innovation
聲明:MOOC中國十分重視知識產權問題,我們發布之課程均源自下列機構,版權均歸其所有,本站僅作報道收錄并尊重其著作權益。感謝他們對MOOC事業做出的貢獻!
  • Coursera
  • edX
  • OpenLearning
  • FutureLearn
  • iversity
  • Udacity
  • NovoEd
  • Canvas
  • Open2Study
  • Google
  • ewant
  • FUN
  • IOC-Athlete-MOOC
  • World-Science-U
  • Codecademy
  • CourseSites
  • opencourseworld
  • ShareCourse
  • gacco
  • MiriadaX
  • JANUX
  • openhpi
  • Stanford-Open-Edx
  • 網易云課堂
  • 中國大學MOOC
  • 學堂在線
  • 頂你學堂
  • 華文慕課
  • 好大學在線CnMooc
  • (部分課程由Coursera、Udemy、Linkshare共同提供)

© 2008-2019 MOOC.CN 慕課改變你,你改變世界

双色球爱彩网专家杀号 云南时时彩 云南十一选五 浙江6+1 11选5 旧版球探即时比分网 体球球毛 吉林时时彩 广东快乐10分 即时比分500 电竞比分直播电竞比分直播 wnba比分结果 体球网 喜乐彩 天津快乐10分 e球彩 世界杯即时赔率大小球