- ISBN:9787564189518
- 装帧:一般胶版纸
- 册数:暂无
- 重量:暂无
- 开本:16开
- 页数:238
- 出版时间:2020-08-01
- 条形码:9787564189518 ; 978-7-5641-8951-8
本书特色
作者Emmanuel Ameisen是一名经验丰富的数据科学家,他领导着一个人工智能教育项目群,通过代码片段、插图和截图以及对行业领袖的采访内容展示实用的机器学习概念。 本书**部分教授如何设计一个机器学习应用程序并评估效果;第二部分介绍如何构建一个可用的机器学习模型;第三部分演示改进模型的方法,让模型满足你*初的设想;第四部分介绍应用部署和监测策略。 这本书将帮助你: 定义产品目标,确立一个机器学习问题 快速构建一个端到端的机器学习流水线并获取一个初始数据集 培训和评估机器学习模型并解决性能瓶颈 在生产环境中部署和监测模型
内容简介
学习设计、构建和部署机器学习(ML)应用所需的技能。通过这本实用的教程,你将构建一个机器学习驱动的示例应用程序,将很初的想法转化成可部署的产品。数据科学家、软件工程师和产品经理一一无论经验丰富的的专家还是刚刚入门的新手一一都可以循序渐进地学习构建实际的机器学习应用程序所涉及的工具、很好实践和技术挑战。 作者Emmanuel Ameisen是一名经验丰富的数据科学家,他领导着一个人工智能教育项目群,通过代码片段、插图和屏幕截图以及对行业的采访内容展示实用的机器学习概念。本书部分教授如何设计一个机器学习应用程序并评估效果;第二部分介绍如何构建一个可用的机器学习模型;第三部分演示改进模型的方法,让模型满足你很初的设想;第四部分介绍应用部署和监测策略。 这本书将帮助你: 定义产品目标,确立一个机器学习问题; 快速构建一个端到端机器学习流水线并获取一个初始数据集; 培训和评估机器学习模型并解决性能瓶颈; 在生产环境中部署和监测模型。
目录
Preface
Part I. Find the Correct ML Approach
1. From Product Goal to ML Framing
Estimate What Is sible
Models
Data
Framing the ML Editor
Trying to Do It All with ML: An End-to-End Framework
The Simplest Approach: Being the Algorithm
Middle Ground: Learning from Our Experience
Monica Rogati: How to Choose and Prioritize ML Projects
Conclusion
2. Createa Plan
Measuring Success
Business Performance
Model Performance
Freshness and Distribution Shift
Speed
Estimate Scope and Challenges
Leverage Domain Expertise
Stand on the Shoulders of Giants
ML Editor Planning
Initial Plan for an Editor
Always Start with a Simple Model
To Make Regular Progress: Start Simple
Start with a Simple Pipeline
Pipeline for the ML Editor
Conclusion
Part II. Build a Working Pipeline
3. Build Your First End-to-End Pipeline
The Simplest Scaffolding
Prototype of an ML Editor
Parse and Clean Data
Tokenizing Text
Generating Features
Test Your Workflow
User Experience
Modeling Results
ML Editor Prototype Evaluation
Model
User Experience
Conclusion
4. Acquire an Initial Dataset
Iterate on Datasets
Do Data Science
Explore Your First Dataset
Be Efficient, Start Small
Insights Versus Products
A Data Quality Rubric
Label to Find Data Trends
Summary Statistics
Explore and Label Efficiently
Be the Algorithm
Data Trends
Let Data Inform Features and Models
Build Features Out of Patterns
ML Editor Features
Robert nro: How Do You Find, Label, and Leverage Data?
Conclusion
Part III. Iterate on Models
5. Train and Evaluate Your Model
The Simplest Appropriate Model
Simple Models
From Patterns to Models
Split Your Dataset
ML Editor Data Split
Judge Performance
Evaluate Your Model: Look Beyond Accuracy
Contrast Data and Predictions
Confusion Matrix
ROC Curve
Calibration Curve
Dimensionality Reduction for Errors
The Top-k Method
Other Models
Evaluate Feature Importancek
Directly from a Classifier
Black-Box Explainers
Conclusion
6. Debug Your ML Problems
Software Best Practices
ML-Specific Best Practices
Debug Wiring: Visualizing and Testing
Start with One Example
Test Your ML Code
Debug Training: Make Your Model Learn
Task Difficulty
Optimization Problems
Debug Generalization: Make Your Model Useful
Data Leakage
Overfitting
Consider the Task at Hand
Conclusion
7. Using Classifiers for Writing Recommendations
Extracting Recommendations from Models
What Can We Achieve Without a Model?
Extracting Global Feature Importance
Using a Model's Score
Extracting Local Feature Importance
Comparing Models
Version 1: The Report Card
Version 2: More Powerful, More Unclear
Version 3: Understandable Recommendations
Generating Editing Recommendations
Conclusion
Part IV. Deploy and Monitor
8. Considerations When Deploying Models
Data Concerns
Data Ownership
Data Bias
Systemic Bias
Modeling Concerns
Feedback Loops
Inclusive Model Performance
Considering Context
Adversaries
Abuse Concerns and Dual-Use
Chris Harland: Shipping Experiments
Conclusion
9. Choose Your Deployment Option
Server-Side Deployment
Streaming Application or API
Batch Predictions
Client-Side Deployment
On Device
Browser Side
Federated Learning: A Hybrid Approach
Conclusion
10. Build Safeguards for Models
Engineer Around Failures
Input and Output Checks
Model Failure Fal
节选
“很多关于机器学习的书都跳过了*困难的部分:提炼问题、调试模型和为客户部署。但本书关注的正式这些内容,可以让你的项目从一个想法变成具有影响力的产品。” ——Alexander Gude (Intuit公司的数据科学家)
作者简介
Emmanuel Ameisen是Stripe公司的机器学习工程师,曾经为Local Motion和Zipcar公司实施并部署了预测分析和机器学习解决方案。*近,他正在领导洞见数据科学的人工智能项目群,指导着100多个机器学习项目。他拥有法国三所大学的人工智能、计算机工程和管理硕士学位。
-
认识电竞
¥36.0¥58.0 -
物联网应用路线图
¥8.2¥39.0 -
整理优先:小改进,大回报,整洁代码设计指南
¥37.7¥49.0 -
开启创意之门 AIGC创作一本通
¥63.8¥89.8 -
AIGC文图学 人类3.0时代的生产力
¥35.0¥48.0 -
生物特征识别算法研究
¥19.1¥39.0 -
计算机视觉:从感知到重建
¥105.8¥149.0 -
做好课题申报:AI辅助申请书写作
¥53.1¥69.8 -
大模型应用开发极简入门 基于GPT-4和ChatGPT
¥49.0¥59.8 -
UG NX10数控加工技能课训
¥39.3¥69.0 -
AI时代程序员开发之道:CHATGPT让程序员插上翅膀
¥60.0¥79.0 -
一本书玩转MIDJOURNEY:人人都可以体验AI绘画的魅力
¥66.7¥139.0 -
必然(修订版)
¥73.9¥89.0 -
GPT时代人类再腾飞
¥56.6¥89.9 -
和秋叶一起学EXCEL(第3版)
¥76.9¥109.9 -
ARDUINO编程从零开始(第3版)
¥36.9¥49.8 -
矩阵力量:线性代数全彩图解+微课+Python编程
¥206.4¥258.0 -
技术为径:带领公司走向卓越的工程师
¥65.6¥79.0 -
人工智能 现代方法 第4版(全2册)
¥120.8¥198.0 -
UG NX10数控编程学习教程 第3版
¥27.2¥49.5