ALT 2011 Table of Contents

Editors' Introduction		1 - 13
Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, and Thomas Zeugmann


INVITED PAPERS

Models for Autonomously Motivated Exploration in Reinforcement Learning, Abstract.		14 - 17
Peter Auer, Shiau Hong Lim, and Chris Watkins.
On the Expressive Power of Deep Architectures, Abstract.		18 - 36
Yoshua Bengio and Olivier Delalleau.
Optimal Estimation , Abstract.		37 - 37
Jorma Rissanen.
Learning from Label Preferences, Abstract.		38 - 38
Eyke Hüllermeier and Johannes Fürnkranz.
Information Distance and Its Extensions, Abstract.		39 - 39
Ming Li.


REGULAR CONTRIBUTIONS




Inductive Inference

Iterative Learning from Positive Data and Counters, Abstract.		40 - 54
Timo Kötzing.
Robust Learning of Automatic Classes of Languages, Abstract.		55 - 69
Sanjay Jain, Eric Martin, and Frank Stephan.
Learning and Classifying, Abstract.		70 - 83
Sanjay Jain, Eric Martin, and Frank Stephan.
Learning Relational Patterns, Abstract.		84 - 98
Michael Geilke and Sandra Zilles.


Regression


Adaptive and Optimal Online Linear Regression on ℓ¹-Balls, Abstract.		99 - 113
Sébastien Gerchinovitz and Jia Yuan Yu
Re-adapting the Regularization of Weights for Non-stationary Regression, Abstract.		114 - 128
Nina Vaits, and Koby Crammer.
Competing against the Best Nearest Neighbor Filter in Regression, Abstract.		129 - 143
Arnak S. Dalalyan, and Joseph Salmon.


Bandit Problems


Lipschitz Bandits without the Lipschitz Constant, Abstract.		144 - 158
Sébastien Bubeck, Gilles Stoltz, and Jia Yuan Yu.
Deviations of Stochastic Bandit Regret, Abstract.		159 - 173
Antoine Salomon and Jean-Yves Audibert.
On Upper-Confidence Bound Policies for Switching Bandit Problems, Abstract.		174 - 188
Aurélien Garivier, and Eric Moulines.
Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits, Abstract.		189 - 203
Alexandra Carpentier, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos, and Peter Auer.


Online Learning

The Perceptron with Dynamic Margin, Abstract.		204 - 218
Constantinos Panagiotakopoulos and Petroula Tsampouka.
Combining Initial Segments of Lists, Abstract.		219 - 233
Manfred K. Warmuth, Wouter M. Koolen, and David P. Helmbold.
Regret Minimization Algorithms for Pricing Lookback Options, Abstract.		234 - 248
Eyal Gofer and Yishay Mansour.
Making Online Decisions with Bounded Memory, Abstract.		249 - 261
Chi-Jen Lu and Wei-Fu Lu.
Universal Prediction of Selected Bits, Abstract.		262 - 276
Tor Lattimore, Marcus Hutter, and Vaibhav Gavane.
Semantic Communication for Simple Goals Is Equivalent to On-line Learning, Abstract.		277 - 291
Brendan Juba, and Santosh Vempala.


Kernel and Margin Based Methods

Accelerated Training of Max-Margin Markov Networks with Kernels, Abstract.		292 - 307
Xinhua Zhang, Ankan Saha, and S. V. N. Vishwanathan.
Domain Adaptation in Regression, Abstract.		308 - 323
Corinna Cortes and Mehryar Mohri.
Approximate Reduction from AUC Maximization to 1-Norm Soft Margin Optimization, Abstract.		324 - 337
Daiki Suehiro, Kohei Hatano, and Eiji Takimoto.


Intelligent Agents

Axioms for Rational Reinforcement Learning, Abstract.		338 - 352
Peter Sunehag and Marcus Hutter.
Universal Knowledge-Seeking Agents, Abstract.		353 - 367
Laurent Orseau.
Asymptotically Optimal Agents, Abstract.		368 - 382
Tor Lattimore, and Marcus Hutter.
Time Consistent Discounting, Abstract.		383 - 397
Tor Lattimore, and Marcus Hutter.


Other Learning Models

Distributional Learning of Simple Context-Free Tree Grammars, Abstract.		398 - 412
Anna Kasprzik and Ryo Yoshinaka.
On Noise-Tolerant Learning of Sparse Parities and Related Problems, Abstract.		413 - 424
Elena Grigorescu, Lev Reyzin, and Santosh Vempala.
Supervised Learning and Co-training, Abstract.		425 - 439
Malte Darnstädt, Hans Ulrich Simon, and Balázs Szörényi.
Learning a Classifier when the Labeling Is Known, Abstract.		440 - 451
Shalev Ben David and Shai Ben-David.


Erratum

Erratum: Learning without Coding, Abstract.		452 - 452
Samuel E. Moelius III and Sandra Zilles.
Author Index		453

©Copyright Notice:
The document of this page is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provision of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable for prosecution under German Copyright Law.

back to the ALT 2011 Proceedings Page

uparrow uparrow back to the Conference Page

Table of Contents

Inductive Inference

Regression

Bandit Problems

Online Learning

Kernel and Margin Based Methods

Intelligent Agents

Other Learning Models

Erratum