机器学习、深度学习与自然语言处理领域推荐的书籍列表
数学基础
2010 - All of Statistics: A Concise Course in Statistical Inference【Book】: The goal of this book is to provide a broad background in probability and statistics for students in statistics, Computer science (especially data mining and machine learning), mathematics, and related disciplines.
2008-统计学完全教程:由美国当代著名统计学家L·沃塞曼所著的《统计学元全教程》是一本几乎包含了统计学领域全部知识的优秀教材。本书除了介绍传统数理统计学的全部内容以外,还包含了Bootstrap方法(自助法)、独立性推断、因果推断、图模型、非参数回归、正交函数光滑法、分类、统计学理论及数据挖掘等统计学领域的新方法和技术。本书不但注重概率论与数理统计基本理论的阐述,同时还强调数据分析能力的培养。本书中含有大量的实例以帮助广大读者快速掌握使用R软件进行统计数据分析。
机器学习
2007 - Pattern Recognition And Machine Learning【Book】: The book is suitable for courses on machine learning, statistics, computer science, signal processing, computer vision, data mining, and bioinformatics.
2012 - Machine Learning A Probabilistic Perspective 【Book】: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, a unified, probabilistic approach. The coverage combines breadth and depth, offering necessary background material on such topics as probability, optimization, and linear algebra as well as discussion of recent developments in the field, including conditional random fields, L1 regularization, and deep learning.
2012 - 李航:统计方法学:李航老师的这本书偏优化和推倒,推倒相应算法的时候可以参考这本书。
2014 - DataScience From Scratch【Book】: In this book, you’ll learn how many of the most fundamental data science tools and algorithms work by implementing them from scratch.
2015 - Python Data Science Handbook【Book】:Jupyter Notebooks for the Python Data Science Handbook
2015 - Data Mining, The Textbook【Book】: This textbook explores the different aspects of data mining from the fundamentals to the complex data types and their applications, capturing the wide diversity of problem domains for data mining issues.
2016 - 周志华 机器学习【Book】:周志华老师的这本书非常适合作为机器学习入门的书籍,书中的例子十分形象且简单易懂。
University of Illinois at Urbana-Champaign:Text Mining and Analytics【Course】
CS224d: Deep Learning for Natural Language Processing【Course】
Unsupervised Feature Learning and Deep Learning【Course】:来自斯坦福的无监督特征学习与深度学习系列教程
深度学习
2015-The Deep Learning Textbook【Book】:中文译本这里,The Deep Learning textbook is a resource intended to help students and practitioners enter the field of machine learning in general and deep learning in particular. The online version of the book is now complete and will remain available online for free.
Stanford Deep Learning Tutorial【Book】: This tutorial will teach you the main ideas of Unsupervised Feature Learning and Deep Learning. By working through it, you will also get to implement several feature learning/deep learning algorithms, get to see them work for yourself, and learn how to apply/adapt these ideas to new problems.
Neural Networks and Deep Learning【Book】: Neural Networks and Deep Learning is a free online book. The book will teach you about: (1) Neural networks, a beautiful biologically-inspired programming paradigm which enables a computer to learn from observational data. (2) Deep learning, a powerful set of techniques for learning in neural networks
Practical Deep Learning For Coders 【Course】:七周的免费深度学习课程,学习如何构建那些优秀的模型。
Oxford Deep NLP 2017 course【Course】: This is an advanced course on natural language processing. Automatically processing natural language inputs and producing language outputs is a key component of Artificial General Intelligence.
自然语言处理
2016 - CS224d: Deep Learning for Natural Language Processing【Course】
2015 - Text Data Management and Analysis【Book】: A Practical Introduction to Information Retrieval and Text Mining
泛数据科学
2012 - 深入浅出数据分析 中文版【Book】:《深入浅出数据分析》以类似“章回小说”的活泼形式,生动地向读者展现优秀的数据分析人员应知应会的技术:数据分析基本步骤、实验方法、最优化方法、假设检验方法、贝叶斯统计方法、主观概率法、启发法、直方图法、回归法、误差处理、相关数据库、数据整理技巧;正文之后,意犹未尽地以三篇附录介绍数据分析十大要务、R工具及ToolPak工具,在充分展现目标知识以外,为读者搭建了走向深入研究的桥梁。
Lean Analytics — by Croll & Yoskovitz: 本书是教会你如何建立基本的以商业思维去使用这些数据,虽然这本书本身定位是面向初学者,不过我觉得你可以从中学到更多。你可以从本书中学到一条基本准则、6个基础的线上商业形态以及隐藏其后的数据策略。
Business value in the ocean of data — by Fajszi, Cser & Fehér: 如果说Lean Analytics是关于面向初学者讲解商业逻辑加上数据,那么本书是面向大型公司来讲解这些内容。听上去好像没啥新鲜的,不过往往初创企业与独角兽之间面对的问题是千差万别,本书中会介绍譬如保险公司是如何进行定价预测或者银行从业者们又在面临怎样的数据问题。
Naked Statistics — Charles Wheelan: 这本书我一直很是推荐,因为它不仅仅面向数据科学家,而是为任何一个行业的人提供基本的统计思维,这一点恰恰是我认为非常关键的。这本书并没有太多的长篇大论,而是以一个又一个的故事形式来讲解统计思维在公司运营中的重要作用。
Doing Data Science — Schutt and O’Neil: 这算是最后一本非技术向的书了吧,这本书相较于上面三本更上一层楼,他深入了譬如拟合模型、垃圾信息过滤、推荐系统等等方面的知识。
Data Science at the Command Line — Janssens: 在介绍本书之前首先要强调下,千万不要畏惧编程,学习些简单的编程知识能够有助于你做更多有趣的事。你可以自己去获取、清洗、转化或者分析你的数据。不过我也不会一上来就扔出大堆的编程知识,我建议还是从简单的命令行操作开始学起,而本书正是介绍如何只用命令行就帮你完成些数据科学的任务。
Python for Data Analysis — McKinney: Python算是近几年来非常流行的数据分析的语言了吧,人生苦短,请用Python。这本书算是个大部头了,有400多页吧,不过它首先为你介绍了Python的基础语法,因此学起来不会很困难吧。
I heart logs — Jay Kreps: 最后一本书则是短小精悍,加起来才60多页吧。不过它对于数据收集和处理的技术背景有很好的概述,虽然很多分析家或者数据科学家并不会直接用到这些知识,但是至少你能够理解技术人员们可以用哪些架构去解决数据问题。
作者:张梓雄
- 点赞
- 收藏
- 关注作者
评论(0)