FB 6 Mathematik/Informatik/Physik

Institut für Mathematik


Navigation und Suche der Universität Osnabrück


Hauptinhalt

Topinformationen

Begleitseminar zum Studienprojekt EBiMAS: Emergent behaviors in a multi-agent system with reinforcement learning (Part V)

8.3579

Dozenten

Beschreibung

In the EBiMAS study project, you will collaborate as a team on developing and training a multi-agent system with a 3D simulation framework that we’ve developed based on Google DeepMind’s MuJoCo physics engine. In the scenario you will work on, simple artificial ant agents must solve a puzzle-foraging task, potentially learning to communicate with each other in the process. The agents are trained with advanced Deep Reinforcement Learning algorithms to investigate emergent properties and dynamics between the agents, such as language, cooperation, and strategies.

Prerequisites for this course include proficiency in programming (Python) and either basic experience with simulation tools like Blender, MuJoCo, Nvidia Isaac, or foundational knowledge of Machine Learning (ML) / Reinforcement Learning (RL). Applicants for the course are asked to submit a brief essay before the start of the semester, introducing themselves, their background, relevant courses and experiences, as well as highlighting their motivation for participating in the project (mail as PDF to julius.mayer@uni-osnabrueck.de). This will help us better understand each candidate and determine the best fit for the program.

Full-team milestone meetings with the supervisors are held bi-weekly on Wednesdays. Specific times may be adjusted based on group availability. Students are expected to self-organize as a team 2-3 times per week, with meeting times for group work agreed upon among students and subgroups.

Note: This course is open to students starting their first part of a two-semester study project in WS24 and those continuing from the previous semester. There is also a separate one-semester interdisciplinary seminar that accompanies the study project.

Weitere Angaben

Ort: nicht angegeben
Zeiten: Mi. 16:30 - 18:30 (wöchentlich) - There will be bi-weekly team meetings for updates and planning with supervisors. Students are expected to self-organize and meet regularily in their work groups 2-3 times per week. Meeting times may adjust based on group availability.
Erster Termin: Mittwoch, 30.10.2024 16:30 - 18:30
Veranstaltungsart: Seminar (Offizielle Lehrveranstaltungen)

Studienbereiche

  • Cognitive Science > Master-Programm
  • Human Sciences (e.g. Cognitive Science, Psychology)

Past and Forthcoming Events

Publications

  • Asymptotics of a time-bounded cylinder model, with N. Aschenbruck and S. Bussmann, Probability in the Engineering and Informational Sciences, https://doi.org/10.1017/S0269964822000420
  • The method of cumulants for the normal approximation, with S. Jansen and K. Schubert, Probability Surveys 2022, Vol. 19, 185-270, https://doi.org/10.1214/22-PS7
  • Sedentary Random Waypoint, with C. Betken, arXiv:2009.02941
  • The Impact of Bit Errors on Intra-Session Network Coding with Heterogeneous Packet Lengths, with B. Schütz, N. Aschenbruck, S. Bussmann and M. Juhnke-Kubitzke, Proc. of the 45th IEEE LCN Symposium on Emerging Topics in Networking LCN, virtually hosted in Sydney, Australia, Nov. 16–19, 2020.
  • Stationarity for the Small World in Motion Mobility Model, with Nils Aschenbruck, Christian Heiden und Matthias Schwamborn, MSWIM '19: Proceedings of the 22nd International ACM Conference on Modeling, Analysis and Simulation of Wireless and Mobile Systems, Nov 25-29, 2019, https://doi.org/10.1145/3345768.3355935
  • Crossing Numbers and Stress of Random Graphs, with Markus Chimani and Matthias Reitzner, In Proceedings 26th International Symposium, GD 2018, Barcelona, Spain, 255--268, 2018 available here and for an extended journal version here: https://arxiv.org/abs/1808.07558
  • Fluctuations in a general preferential attachment model via Stein's method, with Carina Betken and Marcel Ortgiese, Random Structures & algorithms, vol.55, no.4, 2019 available here
  • Connection times in large ad-hoc mobile networks, Bernoulli, vol.22, no.4, 2143--2176, 2016 available here
    with Gabriel Faraud, Wolfgang König
  • The random disc thrower problem, Proceedings of the 90th European Study Group Mathematics with Industry, 59-78, 2013  available here with T. van der Aalst, D. Denteneer, M. Hong Duong, R. J. Kang, M. Keane, J. Kool, I. Kryven, T. Meyfroyt, T. Müller, G. Regts, J. Tomczyk
  • Edge fluctuations of eigenvalues of Wigner matrices, High Dimensional Probability VI: the Banff volume, Progress in Probability, vol.66, 261-275, Springer, Basel, 2013 available here
    with Peter Eichelsbacher
  • Moderate deviations for the determinant of Wigner matrices, Dedicated to Friedrich Götze on the occasion of his sixtieth birthday, Limit Theorems in Probability, Statistics and Number Theory, Springer Proceedings in Mathematics & Statistics, vol.42, 253-275, 2013, available here
    with Peter Eichelsbacher
  • Moderate deviations for the eigenvalue counting function of Wigner matrices, ALEA, Lat. Am. J. Probab. Math. Stat. 10 (1), 27-44, 2013, available here
    with Peter Eichelsbacher
  • Moderate deviations via cumulants, Journal of Theor. Probability, 2012, available here
    with Peter Eichelsbacher
  • Moments of recurrence times for Markov chains, Electronic Comm. Probab., 16(28), 296-303, 2011, available here
    with Frank Aurzada, Marcel Ortgiese, Michael Scheutzow
  • Moderate deviations in a random graph and for the spectrum of Bernoulli random matrices, Electronic Journal of Probability, Vol. 14, Paper no. 92, 2636-2656, 2009, available here
    with Peter Eichelsbacher
  • Perpendicular transport of charged particles in slab turbulence: recovery of diffusion for realistic wave-spectra?, Journal of Physics G: Nuclear and Particle Physics, 35, 025202, 2008
    with Andreas Shalchi
  • Velocity correlation functions of charged test particles, Journal of Physics G: Nuclear and Particle Physics, 34, 859, 2007
    with Andreas Shalchi