在好例子网,分享、交流、成长!
您当前所在位置:首页Others 开发实例一般编程问题 → Machine.Learning.for.Hackers

Machine.Learning.for.Hackers

一般编程问题

下载此实例

实例介绍

【实例简介】Machine.Learning.for.Hackers 

【实例截图】

【核心代码】

Table of Contents
Preface ..................................................................... vii
1. Using R ................................................................ 1
R for Machine Learning 2
Downloading and Installing R 5
IDEs and Text Editors 8
Loading and Installing R Packages 9
R Basics for Machine Learning 12
Further Reading on R 27
2. Data Exploration ....................................................... 29
Exploration versus Confirmation 29
What Is Data? 30
Inferring the Types of Columns in Your Data 34
Inferring Meaning 36
Numeric Summaries 37
Means, Medians, and Modes 37
Quantiles 40
Standard Deviations and Variances 41
Exploratory Data Visualization 44
Visualizing the Relationships Between Columns 61
3. Classification: Spam Filtering ............................................ 73
This or That: Binary Classification 73
Moving Gently into Conditional Probability 77
Writing Our First Bayesian Spam Classifier 78
Defining the Classifier and Testing It with Hard Ham 85
Testing the Classifier Against All Email Types 88
Improving the Results 90
iii
4. Ranking: Priority Inbox ................................................. 93
How Do You Sort Something When You Don’t Know the Order? 93
Ordering Email Messages by Priority 95
Priority Features of Email 95
Writing a Priority Inbox 99
Functions for Extracting the Feature Set 100
Creating a Weighting Scheme for Ranking 108
Weighting from Email Thread Activity 113
Training and Testing the Ranker 117
5. Regression: Predicting Page Views ....................................... 127
Introducing Regression 127
The Baseline Model 127
Regression Using Dummy Variables 132
Linear Regression in a Nutshell 133
Predicting Web Traffic 141
Defining Correlation 152
6. Regularization: Text Regression ......................................... 155
Nonlinear Relationships Between Columns: Beyond Straight Lines 155
Introducing Polynomial Regression 158
Methods for Preventing Overfitting 165
Preventing Overfitting with Regularization 169
Text Regression 174
Logistic Regression to the Rescue 178
7. Optimization: Breaking Codes ........................................... 183
Introduction to Optimization 183
Ridge Regression 190
Code Breaking as Optimization 193
8. PCA: Building a Market Index ........................................... 205
Unsupervised Learning 205
9. MDS: Visually Exploring US Senator Similarity ............................. 215
Clustering Based on Similarity 215
A Brief Introduction to Distance Metrics and Multidirectional Scaling 216
How Do US Senators Cluster? 222
Analyzing US Senator Roll Call Data (101st–111th Congresses) 223
10. kNN: Recommendation Systems ......................................... 233
The k-Nearest Neighbors Algorithm 233
iv | Table of Contents
R Package Installation Data 239
11. Analyzing Social Graphs ................................................ 243
Social Network Analysis 243
Thinking Graphically 246
Hacking Twitter Social Graph Data 248
Working with the Google SocialGraph API 250
Analyzing Twitter Networks 256
Local Community Structure 257
Visualizing the Clustered Twitter Network with Gephi 261
Building Your Own “Who to Follow” Engine 267
12. Model Comparison .................................................... 275
SVMs: The Support Vector Machine 275
Comparing Algorithms 284
Works Cited ................................................................ 293
Index ..................................................................... 295

标签: learning MAChine MAC for ck

网友评论

发表评论

(您的评论需要经过审核才能显示)

查看所有0条评论>>

小贴士

感谢您为本站写下的评论,您的评论对其它用户来说具有重要的参考价值,所以请认真填写。

  • 类似“顶”、“沙发”之类没有营养的文字,对勤劳贡献的楼主来说是令人沮丧的反馈信息。
  • 相信您也不想看到一排文字/表情墙,所以请不要反馈意义不大的重复字符,也请尽量不要纯表情的回复。
  • 提问之前请再仔细看一遍楼主的说明,或许是您遗漏了。
  • 请勿到处挖坑绊人、招贴广告。既占空间让人厌烦,又没人会搭理,于人于己都无利。

关于好例子网

本站旨在为广大IT学习爱好者提供一个非营利性互相学习交流分享平台。本站所有资源都可以被免费获取学习研究。本站资源来自网友分享,对搜索内容的合法性不具有预见性、识别性、控制性,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,平台无法对用户传输的作品、信息、内容的权属或合法性、安全性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论平台是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二与二十三条之规定,若资源存在侵权或相关问题请联系本站客服人员,点此联系我们。关于更多版权及免责申明参见 版权及免责申明

;
报警