• Welcome
  • Publications
  • Teaching
  • CV
    Georgios Tzannetos

    Georgios Tzannetos

    PhD Candidate in the field of Reinforcement Learning and Deep Learning.

    • Germany
    • MPI-SWS
    • Email
    • Twitter
    • LinkedIn
    • Github
    • Google Scholar

    Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

    Published in International Conference on Machine Learning (ICML), 2024

    Share on

    Twitter Facebook LinkedIn
    Previous Next
    Sitemap
    • Follow:
    • GitHub
    • Feed
    © 2024 Georgios Tzannetos. Powered by Jekyll & AcademicPages, a fork of Minimal Mistakes.
    -->