R语言机器学习-第2版-(影印版)

1星价 ¥58.3 (6.7折)

2星价￥58.3 定价￥87.0

作者：(美)布雷特·兰茨(BrettLant

出版社：东南大学出版社

本类榜单：计算机/网络

分类：计算机/网络 > 程序设计 > 其他

买过本商品的人还买了

暂无评论

图文详情

ISBN：9787564170714
装帧：暂无
册数：暂无
重量：暂无
开本：32开
页数：426
出版时间：2017-04-01
条形码：9787564170714 ; 978-7-5641-7071-4

本书特色

布雷特·兰茨编*的《R语言机器学习（第2版影印版）（英文版）》与时俱进。携*新的库和*现代的编程思维为你丝丝入扣地介绍了专业数据科学必不可少的技能。不用再惧怕理论知识。书中提供了编写算法和处理数据所需的*关键的实用知识，只要有* 基本的经验就可以了。
你可以在书中找到洞悉复杂的数据所需的全部分析工具，还能学到如何选择正确的算法来解决特定的问题。通过与各种真实问题的亲密接触，你将学会如何应用机器学习方法来处理常见的任务，包括分类、预测、市场分析以及聚类。
目标读者可能你对机器学习多少有一点了解，但是从没用过R语言，或者是知道些R语言，但是没接触过机器学习。不管是哪一种情况，本书都能够帮助你快速上手。如果熟悉一些编程概念自然是*好的。不过并不要求之前有编程经验。
你将从本书中学到什么驾驭R语言的威力，使用真实的数据科学应用构建常见的机器学习算法。
学习利用R语言技术对待分析数据进行清理和预处理并可视化处理结果。
了解不同类型的机器学习模型，选择符合数据处理需求的*佳模型，解决数据分析难题。
使用贝叶斯算法和*近邻算法分类数据。
使用R语言预测数值来构建决策树、规则以及支持向量机。
使用线性回归预测数值，使用神经网络建模数据。
对机器学习模型性能进行评估和改进。
学习专用于文本挖掘、社交网络数据、大数据等的机器学习技术。

内容简介

本书与时俱进，携*新的库和*现代的编程思维为你丝丝入扣的介绍了专业数据科学必不可少的技能。不用再惧怕理论知识，书中提供了编写算法和处理数据所需的*关键的实用知识，只要有*基本的经验就可以了。你可以在书中找到洞悉复杂的数据所需的全部分析工具，还能学到如何选择正确的算法来解决特定的问题。通过与各种真实问题的亲密接触，你将学会如何应用机器学习方法来处理常见的任务，包括分类、预测、市场分析以及聚类。

Preface Chapter 1: Introducing Machine LearningThe origins of machine learningUses and abuses of machine learningMachine learning successesThe limits of machine learningMachine learning ethicsHow machines learnData storageAbstractionGeneralizationEvaluationMachine learning in practiceTypes of input dataTypes of machine learning algorithmsMatching input data to algorithmsMachine learning with RInstalling R packagesLoading and unloading R packagesSummary Chapter 2: Managing and Understanding Data R data structuresVectorsFactorsListsData framesMatrixes and arraysManaging data with RSaving, loading, and removing R data structuresImporting and saving data from CSV filesExploring and understanding dataExploring the structure of dataExploring numeric variablesMeasuring the central tendency- mean and medianMeasuring spread - quartiles and the five-number summaryVisualizing numeric variables - boxplotsVisualizing numeric variables - histogramsUnderstanding numeric data - uniform and normal distributionsMeasuring spread - variance and standard deviationExploring categorical variablesMeasuring the central tendency - the modeExploring relationships between variablesVisualizing relationships - scatterplotsExamining relationships - two-way cross-tabulationsSummary Chapter 3: Lazy Learning - Classification Using Nearest NeighborsUnderstanding nearest neighbor classificationThe k-NN algorithmMeasuring similarity with distanceChoosing an appropriate kPreparing data for use with k-NNWhy is the k-NN algorithm lazy?Example - diagnosing breast cancer with the k-NN algorithmStep 1 - collecting dataStep 2 - exploring and preparing the dataTransformation - normalizing numeric dataData preparation - creating training and test datasetsStep 3 - training a model on the dataStep 4 - evaluating model performanceStep 5 -improving model performanceTransformation - z-score standardizationTesting alternative values of kSummary Chapter 4: Probabilistic Learning - Classification Using Naive BayesUnderstanding Naive BayesBasic concepts of Bayesian methodsUnderstanding probabilityUnderstanding joint probabilityComputing conditional probability with Bayes' theoremThe Naive Bayes algorithmClassification with Naive BayesThe Laplace estimatorUsing numeric features with Naive BayesExample - filtering mobile phone spam with theNaive Bayes algorithmStep 1 - collecting dataStep 2 - exploring and preparing the dataData preparation - cleaning and standardizing text dataData preparation - splitting text documents into wordsData preparation - creating training and test datasetsVisualizing text data - word cloudsData preparation - creating indicator features for frequent wordsStep 3 - training a model on the dataStep 4 - evaluating model performanceStep 5 -improving model performanceSummary Chapter 5: Divide and Conquer - Classification Using Decision Trees and Rules Chapter 6: Forecasting Numeric Data - Regression Methods Chapter 7: Black Box Methods - Neural Networks and Support Vector Machines Chapter 8: Finding Patterns - Market Basket Analysis Using Association Rules Chapter 9: Finding Groups of Data - Clustering with k-means Chapter 10: Evaluating Model Performance Chapter 11: Improving Model Performance Chapter 12: Specialized Machine Learning Topics Index

展开全部

作者简介

布雷特·兰茨（Brett Lantz）在应用创新的数据方法来理解人类的行为方面有10余年经验。他*初是一名社会学家，在学习一个青少年社交网站分布的大型数据库时，他就开始陶醉于机器学习。从那时起，他致力于移动电话、医疗账单数据和公益活动等交叉学科的研究，并维护dataspelunking.com这个网站，该网站致力于分享有关探寻数据中所蕴含的洞察的知识。

本类五星书