暂无评论
图文详情
- ISBN:9787121438103
- 装帧:平塑
- 册数:暂无
- 重量:暂无
- 开本:其他
- 页数:220
- 出版时间:2023-03-01
- 条形码:9787121438103 ; 978-7-121-43810-3
内容简介
本书介绍如何结合Python进行网络爬虫程序的开发,从Python语言的基本特性入手,详细介绍了Python网络爬虫开发的各个方面,涉及HTTP、HTML、JavaScript、正则表达式、自然语言处理、数据科学等不同领域的内容。全书共10章,包括Python基础知识、网站分析、网页解析、Python文件读写、Python与数据库、AJAX技术、模拟登录、文本与数据分析、网站测试、Scrapy爬虫框架、爬虫性能等多个主题。本书可作为高等职业院校计算机类专业的专业课教材,也可供计算机相关从业人员选用参考。
目录
目录
项目一 Python 基础认知 ····················································································.1
任务一 Python 概述 ·······································································································.1
一、Python 简介 ······································································································.1
二、安装Python ······································································································.2
三、安装PyCharm ···································································································.6
四、Python 语法规范 ·······························································································.11
任务二 Python 命令的组成 ·····························································································.13
一、基本符号 ·········································································································.14
二、常量与变量 ······································································································.16
三、数据类型 ·········································································································.19
四、功能符号 ·········································································································.24
任务三 程序结构 ·········································································································.26
一、表达式语句 ······································································································.26
二、顺序结构 ·········································································································.27
三、选择结构 ·········································································································.28
四、循环结构 ·········································································································.30
五、条件表达式 ······································································································.31
六、程序的流程控制 ································································································.32
项目实战 ·····················································································································.33
实战 输出百度网址 ································································································.33
项目二 网络爬虫基础认知 ················································································.35
任务一 网络爬虫概述 ···································································································.35
一、网络爬虫的基本原理 ··························································································.36
二、网络爬虫系统框架 ·····························································································.37
三、爬行策略 ·········································································································.37
四、网络爬虫的分类 ································································································.38
五、开源网络爬虫框架/项目 ······················································································.39
任务二 HTTP ·············································································································.41
一、HTTP 的工作原理 ·····························································································.41
二、Urllib 模块库 ···································································································.42
三、URL 定义 ·······································································································.43
四、URL 编码设置 ·································································································.47
任务三 网页请求过程 ···································································································.50
一、发送请求报文 ··································································································.51
二、返回响应 ········································································································.52
三、HTTP 消息 ··········································································
项目一 Python 基础认知 ····················································································.1
任务一 Python 概述 ·······································································································.1
一、Python 简介 ······································································································.1
二、安装Python ······································································································.2
三、安装PyCharm ···································································································.6
四、Python 语法规范 ·······························································································.11
任务二 Python 命令的组成 ·····························································································.13
一、基本符号 ·········································································································.14
二、常量与变量 ······································································································.16
三、数据类型 ·········································································································.19
四、功能符号 ·········································································································.24
任务三 程序结构 ·········································································································.26
一、表达式语句 ······································································································.26
二、顺序结构 ·········································································································.27
三、选择结构 ·········································································································.28
四、循环结构 ·········································································································.30
五、条件表达式 ······································································································.31
六、程序的流程控制 ································································································.32
项目实战 ·····················································································································.33
实战 输出百度网址 ································································································.33
项目二 网络爬虫基础认知 ················································································.35
任务一 网络爬虫概述 ···································································································.35
一、网络爬虫的基本原理 ··························································································.36
二、网络爬虫系统框架 ·····························································································.37
三、爬行策略 ·········································································································.37
四、网络爬虫的分类 ································································································.38
五、开源网络爬虫框架/项目 ······················································································.39
任务二 HTTP ·············································································································.41
一、HTTP 的工作原理 ·····························································································.41
二、Urllib 模块库 ···································································································.42
三、URL 定义 ·······································································································.43
四、URL 编码设置 ·································································································.47
任务三 网页请求过程 ···································································································.50
一、发送请求报文 ··································································································.51
二、返回响应 ········································································································.52
三、HTTP 消息 ··········································································
展开全部
作者简介
耿兴隆,Autodesk中国认证考试中心首席专家,全面负责Autodesk中国官方认证考试大纲制定、题库建设、技术咨询和师资力量培训工作。其创作的很多教材成为国内具有引导性的旗帜作品,在国内相关专业方向图书创作领域具有举足轻重的地位。
本类五星书
浏览历史
本类畅销
-
造神:人工智能神话的起源和破除 (精装)
¥32.7¥88.0 -
过程控制技术(第2版高职高专规划教材)
¥27.6¥38.0 -
专业导演教你拍好短视频
¥13.8¥39.9 -
数学之美
¥41.0¥69.0 -
系统性创新手册(管理版)
¥42.6¥119.0 -
人工智能
¥20.3¥55.0 -
硅谷之火-人与计算机的未来
¥15.5¥39.8 -
WPS OFFICE完全自学教程(第2版)
¥97.3¥139.0 -
人工智能基础及应用
¥34.6¥48.0 -
深入浅出软件架构
¥117.2¥186.0 -
软件设计的哲学(第2版)
¥54.0¥69.8 -
大数据技术导论(第2版)
¥28.9¥41.0 -
人工智能的底层逻辑
¥55.3¥79.0 -
剪映+PREMIERE+AIGC 短视频制作速成
¥73.5¥98.0 -
人人都能学AI
¥49.3¥68.0 -
剪映AI
¥52.0¥88.0 -
数据挖掘技术与应用
¥46.0¥75.0 -
数据采集与处理
¥36.4¥49.8 -
PLC结构化文本编程(第2版)
¥56.3¥79.0 -
中小型网络组建与管理
¥30.7¥43.0