Java数据分析-(影印版)
温馨提示:5折以下图书主要为出版社尾货,大部分为全新(有塑封/无塑封),个别图书品相8-9成新、切口有划线标记、光盘等附件不全详细品相说明>>
- ISBN:9787564177362
- 装帧:一般胶版纸
- 册数:暂无
- 重量:暂无
- 开本:16开
- 页数:390
- 出版时间:2018-08-01
- 条形码:9787564177362 ; 978-7-5641-7736-2
内容简介
数据分析是包含检查、清洗、转化和建模的整个过程,旨在发现有用的信息。Java是实现数据分析任务的流行语言之一。 《Java数据分析(影印版 英文版)》将提供数据科学和相关流程步骤的快速概览。你将从中学到统计数据分析技巧,并通过流行的Java API和类库把它们实现。你还能在实际案例中学到诸如分类和回归之类的机器学习概念。 在这个过程中,你将熟悉RapidMinet和Weka等工具,了解这些Java工具如何更有效地用于分析。还会学到如何与关系型、NoSQL和时间序列数据打交道。 《Java数据分析(影印版 英文版)》也将介绍如何利用不同的Java类库创建富有洞见又容易理解的图表。 学完《Java数据分析(影印版 英文版)》,你将对多种数据分析技巧和相应的Java实现拥有扎实的基础知识。
目录
Chapter 1: Introduction to Data Analysis
Origins of data analysis
The scientific method
Actuarial science
Calculated by steam
A spectacular example
Herman Hollerith
ENIAC
VisiCalc
Data, information, and knowledge
Why Java?
Java Integrated Development Environments
Summary
Chapter 2: Data Pre_processing
Data types
Variables
Data points and datasets
Null values
Relational database tables
Key fields
Key-value pairs
Hash tables
File formats
Microsoft Excel data
XML and JSON data
Generating test datasets
Metadata
Data cleaning
Data scaling
Data filtering
Sorting
Merging
Hashing
Summary
Chapter 3: Data Visualization
Tables and graphs
Scatter plots
Line graphs
Bar charts
Histograms
Time series
Java implementation
Moving average
Data ranking
Frequency distributions
The normal distribution
A thought experiment
The exponential distribution
Java example
Summary
Chapter 4: Statistics
Descriptive statistics
Random sampling
Random variables
Probability distributions
Cumulative distributions
The binomial distribution
Multivariate distributions
Conditional probability
The independence of probabilistic events
Contingency tables
Bayes' theorem
Covariance and correlation
The standard normal distribution
The central limit theorem
Confidence intervals
Hypothesis testing
Summary
Chapter 5: Relational Databases
The relation data model
Relational databases
Foreign keys
Relational database design
Creating a database
SQL commands
Inserting data into the database
Database queries
SQL data types
JDBC
Using a JDBC PreparedStatement
Batch processing
Database views
Subqueries
Table indexes
Summary
Chapter 6: Regression Analysis
Linear regression
Linear regression in Excel
Computing the regression coefficients
Variation statistics
Java implementation of linear regression
Anscombe's quartet
Polynomial regression
Multiple linear regression
The Apache Commons implementation
Curve fitting
Summary
Chapter 7: Classification Analysis
Decision trees
What does entropy have to do with it?
The ID3 algorithm
Java Implementation of the ID3 algorithm
The Weka platform
The ARFF filetype for data
Java implementation with Weka
Bayesian classifiers
Java implementation with Weka
Support vector machine algorithms
Logistic regression
K-Nearest Neighbors
Fuzzy classification algorithms
Summary
Chapter 8: Cluster Analysis
Measuring distances
The curse of dimensionality
Hierarchical clustering
Weka implementation
K-means clustering
K-mecloids clustering
Affinity propagation clustering
Summary
Chapter 9: Recommender Systems
Utility matrices
Similarity measures
Cosine similarity
A simple recommender system
Amazon's item-to-item collaborative filtering recommender
Implementing user ratings
Large sparse matrices
Using random access files
The Netflix prize
Summary
Chapter 10: NoSQL Databases
The Map data structure
SQL versus NoSQL
The Mongo database system
The Library database
Java development with MongoDB
The MongoDB extension for geospatial databases
Indexing in MongoDB
Why NoSQL and why MongoDB?
Other NoSQL database systems
Summary
Chapter 11:Data Analysis with Java
Scaling, data striping, and sharding
Google's PageRank algorithm
Google's MapReduce framework
Some examples of MapReduce applications
The WordCount example
Scalability
Matrix multiplication with MapReduce
MapReduce in MongoDB
Apache Hadoop
Hadoop MapReduce
Summary
Appendix: Java Tools
The command line
Java
NetBeans
MySQL
MySQL Workbench
Accessing the MySQL database from NetBeans
The Apache Commons Math Library
The javax JSON Library
The Weka libraries
MongoDB
Index
-
全图解零基础word excel ppt 应用教程
¥12.0¥48.0 -
C Primer Plus 第6版 中文版
¥62.6¥108.0 -
零信任网络:在不可信网络中构建安全系统
¥34.2¥59.0 -
有限与无限的游戏:一个哲学家眼中的竞技世界
¥37.4¥68.0 -
硅谷之火-人与计算机的未来
¥20.3¥39.8 -
情感计算
¥66.8¥89.0 -
大模型RAG实战 RAG原理、应用与系统构建
¥74.3¥99.0 -
大学计算机基础实验教程(MS Office版)——面向数据分析能力培养
¥29.1¥39.8 -
LINUX企业运维实战(REDIS+ZABBIX+NGINX+PROMETHEUS+GRAFANA+LNMP)
¥51.8¥69.0 -
AI虚拟数字人:商业模式+形象创建+视频直播+案例应用
¥70.0¥89.8 -
LINUX实战——从入门到精通
¥52.4¥69.0 -
剪映AI
¥52.8¥88.0 -
快速部署大模型:LLM策略与实践(基于ChatGPT等大语言模型)
¥56.9¥79.0 -
数据驱动的工业人工智能:建模方法与应用
¥68.3¥99.0 -
数据存储架构与技术(第2版)
¥62.9¥89.8 -
纹样之美:中国传统经典纹样速查手册
¥76.3¥109.0 -
UG NX 12.0数控编程
¥24.8¥45.0 -
MATLAB计算机视觉与深度学习实战(第2版)
¥90.9¥128.0 -
UN NX 12.0多轴数控编程案例教程
¥24.3¥38.0 -
实战知识图谱
¥51.8¥69.0