新智元整理1
整理:李宏菲
新年伊始,相信每個人已經(jīng)制定好了自己2016年的計劃。隨著無人機和智能機器人在春晚亮相,想必許多人會對“人工智能”、“機器學(xué)習(xí)”,“深度學(xué)習(xí)”這些科技熱詞充滿了好奇。為此,新智元給眾多熱愛人工智能領(lǐng)域的讀者準(zhǔn)備了一份豐厚的大理。小編深知許多對人工智能領(lǐng)域感興趣的讀者可能還不知如何入手該領(lǐng)域,那么,小編建議就從了解深度學(xué)習(xí)開始吧!新智元為學(xué)習(xí)深度學(xué)習(xí)的初學(xué)者整理了一份非常全面的書單,下面就隨小編一起來看看這份書單中包含哪些板塊的內(nèi)容呢?
一、關(guān)于矩陣或者單變量微積分計算的文獻(共5項)
Introduction to Algorithms by Erik Demaine and Srinivas Devadas.
Single Variable Calculus by David Jerison.
Multivariable Calculus by Denis Auroux.
Differential Equations by Arthur Mattuck, Haynes Miller, Jeremy Orloff, John Lewis.
Linear Algebra by Gilbert Strang.
二、基于深度學(xué)習(xí)的計算理論,學(xué)習(xí)理論,神經(jīng)科學(xué)等等(共12項)
Introduction to the Theory of Computation by Michael Sipser.
Artificial Intelligence: A Modern Approach by Stuart Russell and Peter Norvig.
Pattern Recognition and Machine Learning by Christopher Bishop.
Machine Learning: A probabilistic perspective by Kevin Patrick Murphy.
CS229 Machine Learning Course Materials by Andrew Ng at Stanford University.
Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto.
Probabilistic Graphical Models: Principles and Techniques by Daphne Koller and Nir Friedman.
Convex Optimization by Stephen Boyd and Lieven Vandenberghe.
An Introduction to Statistical Learning with application in R by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani.
Neuronal Dynamics: From single neurons to networks and models of cognition by Wulfram Gerstner, Werner M. Kistler, Richard Naud and Liam Paninski.
Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems by Peter Dayan and Laurence F. Abbott.
Michael I. Jordan Reading List of Machine Learning at Hacker News.
三、關(guān)于深度學(xué)習(xí)基礎(chǔ)知識的文獻(共5項)
Deep Learning in Neural Networks: An Overview by Jürgen Schmidhuber.
Deep Learning Book by Yoshua Bengio, Ian Goodfellow and Aaron Courville.
Learning Deep Architectures for AI by Yoshua Bengio.
Representation Learning: A Review and New Perspectives by Yoshua Bengio, Aaron Courville, Pascal Vincent.
Reading lists for new LISA students by LISA Lab, University of Montreal.
四、關(guān)于深度學(xué)習(xí)的教材,實用手冊以及有用的軟件(共17項)
Machine Learning by Andrew Ng.
Neural Networks for Machine Learning by Geoffrey Hinton.
Deep Learning Tutorial by LISA Lab, University of Montreal.
Unsupervised Feature Learning and Deep Learning Tutorial by AI Lab, Stan ford University.
CS231n: Convolutional Neural Networks for Visual Recognition by Stanfor d Uiversity.
CS224d: Deep Learning for Natural Language Processing by Stanford Univer sity.
Theano by LISA Lab, University of Montreal.
PyLearn2 by LISA Lab, University of Montreal.
Caffe by Berkeley Vision and Learning Center (BVLC) and community contrib utor Yangqing Jia.
Torch 7
neon by Nervana.
cuDNN by NVIDIA.
ConvNetJS by Andrej Karpathy.
DeepLearning4j
Chainer: Neural network framework by Preferred Networks, Inc.
Blocks by LISA Lab, University of Montreal.
Fuel by LISA Lab, University of Montreal.
五、關(guān)于深度學(xué)習(xí)和特征學(xué)習(xí)的文獻(共11項)
Automatic Speech Recognition - A Deep Learning Approach by Dong Yu an d Li Deng (Published by Springer, no Open Access)
Backpropagation Applied to Handwritten Zip Code Recognition by Y. LeCu n, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard an d L. D. Jackel.
Comparison of Training Methods for Deep Neural Networks by Patrick O. Glauner.
Deep Learning by Yann LeCun, Yoshua Bengio, Geoffrey Hinton. (NO FREE COPY AVAILABLE)
Distributed Representations of Words and Phrases and their Compositionality by Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado and Jeffrey Dean.
Efficient Estimation of Word Representations in Vector Space by Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean.
Efficient Large Scale Video Classification by Balakrishnan Varadarajan, George Toderici, Sudheendra Vijayanarasimhan, Apostol Natsev.
Foundations and Trends in Signal Processing: DEEP LEARNING — Methods and Applications by Li Deng and Dong Yu.
From Frequency to Meaning: Vector Space Models of Semantics by Peter D. Turney and Patrick Pantel.
LSTM: A Search Space Odyssey by Klaus Greff, Rupesh Kumar Srivastava, Jan Koutník, Bas R. Steunebrink, Jürgen Schmidhuber.
Supervised Sequence Labelling with Recurrent Neural Networks by Alex Graves.
六、最近必讀的關(guān)于深度學(xué)習(xí)領(lǐng)域的最新進展(共332項)
A Convolutional Attention Network for Extreme Summarization of Source Code by Miltiadis Allamanis, Hao Peng, Charles Sutton.
A Deep Bag-of-Features Model for Music Auto-Tagging by Juhan Nam, Jorge Herrera, Kyogu Lee.
A Deep Generative Deconvolutional Image Model by Yunchen Pu, Xin Yuan, Andrew Stevens, Chunyuan Li, Lawrence Carin.
A Deep Neural Network Compression Pipeline: Pruning, Quantization, Huffman Encoding by Song Han, Huizi Mao, William J. Dally.
A Deep Pyramid Deformable Part Model for Face Detection by Rajeev Ranjan, Vishal M. Patel, Rama Chellappa.
A Deep Siamese Network for Scene Detection in Broadcast Videos by Lorenzo Baraldi, Costantino Grana, Rita Cucchiara.
A Hierarchical Recurrent Encoder-Decoder For Generative Context-Aware Query Suggestion by Alessandro Sordoni, Yoshua Bengio, Hossein Vahabi, Christina Lioma, Jakob G. Simonsen, Jian-Yun Nie.
A Large-Scale Car Dataset for Fine-Grained Categorization and Verification by Linjie Yang, Ping Luo, Chen Change Loy, Xiaoou Tang.
A Lightened CNN for Deep Face Representation by Xiang Wu, Ran He, Zhenan Sun.
A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction by Thomas Wiatowski, Helmut B?lcskei.
A Multi-scale Multiple Instance Video Description Network by Huijuan Xu, Subhashini Venugopalan, Vasili Ramanishka, Marcus Rohrbach, Kate Saenko.
A Recurrent Latent Variable Model for Sequential Data by Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron Courville, Yoshua Bengio.
A Restricted Visual Turing Test for Deep Scene and Event Understanding by Hang Qi, Tianfu Wu, Mun-Wai Lee, Song-Chun Zhu.
A Sensitivity Analysis of (and Practitioners’ Guide to) Convolutional Neural Networks for Sentence Classification by Ye Zhang, Byron Wallace.
ABC-CNN: An Attention Based Convolutional Neural Network for Visual Question Answering by Kan Chen, Jiang Wang, Liang-Chieh Chen, Haoyuan Gao, Wei Xu, Ram Nevatia.
Accelerating Very Deep Convolutional Networks for Classification and Detection by Xiangyu Zhang, Jianhua Zou, Kaiming He, Jian Sun.
Accurate Image Super-Resolution Using Very Deep Convolutional Networks by Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee.
Action Recognition using Visual Attention by Shikhar Sharma, Ryan Kiros, Ruslan Salakhutdinov.
Action Recognition With Trajectory-Pooled Deep-Convolutional Descriptors by Limin Wang, Yu Qiao, Xiaoou Tang.
Action-Conditional Video Prediction using Deep Networks in Atari Games by Junhyuk Oh, Xiaoxiao Guo, Honglak Lee, Richard Lewis, Satinder Singh.
Active Object Localization with Deep Reinforcement Learning by Juan C. Caicedo, Svetlana Lazebnik.
adaQN: An Adaptive Quasi-Newton Algorithm for Training RNNs by Nitish Shirish Keskar, Albert S. Berahas.
Adding Gradient Noise Improves Learning for Very Deep Networks by Arvind Neelakantan, Luke Vilnis, Quoc V. Le, Ilya Sutskever, Lukasz Kaiser, Karol Kurach, James Martens.
Adversarial Autoencoders by Alireza Makhzani, Jonathon Shlens, Navdeep Jaitly, Ian Goodfellow.
Adversarial Manipulation of Deep Representations by Sara Sabour, Yanshuai Cao, Fartash Faghri, David J. Fleet.
All you need is a good init by Dmytro Mishkin, Jiri Matas.
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition by Baoguang Shi, Xiang Bai, Cong Yao.
Answer Sequence Learning with Neural Networks for Answer Selection in Community Question Answering by Xiaoqiang Zhou, Baotian Hu, Qingcai Chen, Buzhou Tang, Xiaolong Wang.
Anticipating the future by watching unlabeled video by Carl Vondrick, Hamed Pirsiavash, Antonio Torralba.
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering by Haoyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, Wei Xu.
Artificial Neural Networks Applied to Taxi Destination Prediction by Alexandre de Brébisson, étienne Simon, Alex Auvolat, Pascal Vincent, Yoshua Bengio.
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering by Huijuan Xu, Kate Saenko.
Ask Me Anything: Dynamic Memory Networks for Natural Language Processing by Ankit Kumar, Ozan Irsoy, Jonathan Su, James Bradbury, Robert English, Brian Pierce, Peter Ondruska, Ishaan Gulrajani, Richard Socher.
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources by Qi Wu, Peng Wang, Chunhua Shen, Anton van den Hengel, Anthony Dick.
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images by Mateusz Malinowski, Marcus Rohrbach, Mario Fritz.
Associative Long Short-Term Memory by Ivo Danihelka, Greg Wayne, Benigno Uria, Nal Kalchbrenner, Alex Graves.
AttentionNet: Aggregating Weak Directions for Accurate Object Detection by Donggeun Yoo, Sunggyun Park, Joon-Young Lee, Anthony Paek, In So Kweon.
Attention-Based Models for Speech Recognition by Jan Chorowski, Dzmitry Bahdanau, Dmitriy Serdyuk, Kyunghyun Cho, Yoshua Bengio.
Attention to Scale: Scale-aware Semantic Image Segmentation by Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, Alan L. Yuille.
Attention with Intention for a Neural Network Conversation Model by Kaisheng Yao, Geoffrey Zweig, Baolin Peng.
AtomNet: A Deep Convolutional Neural Network for Bioactivity Prediction in Structure-based Drug Discovery by Izhar Wallach, Michael Dzamba, Abraham Heifets.
七、數(shù)據(jù)集(共13項)
Caltech 101 by L. Fei-Fei, R. Fergus and P. Perona.
Caltech 256 by G. Griffin, AD. Holub, P. Perona.
CIFAR-10 by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton.
CIFAR-100 by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton.
The Comprehensive Cars (CompCars) dataset by Linjie Yang, Ping Luo, Chen Change Loy, Xiaoou Tang.
Flickr30k by Peter Young, Alice Lai, Micah Hodosh, Julia Hockenmaier.
ImageNet by Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg and Li Fei-Fei.
Microsoft COCO by Microsoft Research.
MNIST by Yann LeCun, Corinna Cortes, Christopher J.C. Burges.
Places by MIT Computer Science and Artificial Intelligence Laboratory.
STL-10 by Adam Coates, Honglak Lee, Andrew Y. Ng.
SVHN by Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, Andrew Y. Ng.
WWW Crowd Dataset by Jing Shao, Kai Kang, Chen Change Loy, and Xiaogang Wang.
八、關(guān)于學(xué)習(xí)深度學(xué)習(xí)的博客、訪談欄目等等(共4項)
Talking Machines hosted by Katherine Gorman and Ryan Adams.
Machine Learning & Computer Vision Talks by computervisiontalks.
How we’re teaching computers to understand pictures by Fei-Fei Li, Stanford University.
Deep Learning Community
九、亞馬遜提供的用于深度學(xué)習(xí)的公共AMI網(wǎng)絡(luò)服務(wù)(共3項)
DGYDLGPUv4 (ami-ba516ee8) [Based on g2.2xlarge]
DGYDLGPUXv1 (ami-52516e00) [Based on g2.8xlarge]
Caffe/CuDNN built 2015-05-04 (ami-763a331e) [For both g2.2xlarge and g2.8xlarge]
十、實用的深度神經(jīng)網(wǎng)絡(luò)—從GPU計算的角度來看(共26項)
Introduction
Python Platform for Scientific Computing
Theano Crash Course
Machine Learning Basics
Softmax Regression
Feedforward Neural Networks
Convolutional Neural Networks
Recurrent Neural Networks
Python Warm-up, pre-processing
Feedforward Layer
Softmax Regression
Multi Layer Perceptron Network
Feedforward Model
Auto-encoder
Convolutional Neural Networks
Recurrent Neural Networks
Telauges (10項)
A new deep learning library for learning DL.
MLP Layers: Tanh Layer, Sigmoid Layer, Identity Layer, ReLU Layer
Softmax Regression
ConvNet layers: Tanh Layer, Sigmoid Layer, Identity Layer, ReLU Layer
Max-Pooling layer
Max-Pooling same size
Feedforward Model
Auto-Encoder Model
SGD, Adagrad, Adadelta, RMSprop, Adam
Dropout