UB Paderborn / Katalog / Suche / Details

Ergebnis 5 von 3097

IEEE transactions on systems, man, and cybernetics, 1988-05, Vol.18 (3), p.451-458

1988

Autor(en) / Beteiligte

Titel

epsilon -optimal discretized linear reward-penalty learning automata

Ist Teil von

IEEE transactions on systems, man, and cybernetics, 1988-05, Vol.18 (3), p.451-458

Ort / Verlag

New York, NY: IEEE

Erscheinungsjahr

1988

Link zum Volltext

Quelle

IEEE Xplore

Beschreibungen/Notizen

Variable-structure stochastic automata (VSSA) are considered which interact with an environment and which dynamically learn the optimal action that the environment offers. Like all VSSA the automata are fully defined by a set of action-probability updating rules. However, to minimize the requirements on the random-number generator used to implement the VSSA, and to increase the speed of convergence of the automation, the case in which the probability-updating functions can assume only a finite number of values. These values discretize the probability space (0, 1) and hence they are called discretized learning automata. The discretized automata are linear because the subintervals of (0, 1) are of equal length. The authors prove the following results: (a) two-action discretized linear reward-penalty automata are ergodic and epsilon -optimal in all environments whose minimum penalty probability is less than 0.5; (b) there exist discretized two-action linear reward-penalty automata that are ergodic and epsilon -optimal in all random environments, and (c) discretized two-action linear reward-penalty automata with artificially created absorbing barriers are epsilon -optimal in all random environments.< >

Sprache: Englisch
Identifikatoren: ISSN: 0018-9472
eISSN: 2168-2909
DOI: 10.1109/21.7494
Titel-ID: cdi_pascalfrancis_primary_7285797

Format: –
Schlagworte: Applied sciences, Automata. Logic controller, Automatic testing, Computer science, control theory, systems, Control theory. Systems, Convergence, Exact sciences and technology, Learning automata, Machine learning, Pattern recognition, Random number generation, Routing, Stochastic processes, System testing, Telephony

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX