Brandon Rohrer on Nostr: Personal project update: Myrtle is a workbench for running reinforcement learning ...
Personal project update: Myrtle is a workbench for running reinforcement learning algorithms against real-time environments (including physical robots). Version 0.0.5 is out and has a snappy demo.
What’s different about Myrtle?
It allows for multiple, intermittent reward channels. This is to allow for ongoing human feedback, but not to read anything into it when the human gets distracted or steps out for a coffee.
Published at
2024-06-08 23:22:36Event JSON
{
"id": "89f8b38cb13917139b67fd513965f3cfba5bbd017f332bc6228978362de2be79",
"pubkey": "95ea081a627cee44e532825986ecc662139d068c4bdacbe820a8f445b9c6c06b",
"created_at": 1717888956,
"kind": 1,
"tags": [
[
"imeta",
"url https://cdn.masto.host/recsyssocial/media_attachments/files/112/583/533/184/204/595/original/20bbd6a47a7104d3.png",
"m image/png"
],
[
"proxy",
"https://recsys.social/@brohrer/112583570681811299",
"web"
],
[
"proxy",
"https://recsys.social/users/brohrer/statuses/112583570681811299",
"activitypub"
],
[
"L",
"pink.momostr"
],
[
"l",
"pink.momostr.activitypub:https://recsys.social/users/brohrer/statuses/112583570681811299",
"pink.momostr"
]
],
"content": "Personal project update: Myrtle is a workbench for running reinforcement learning algorithms against real-time environments (including physical robots). Version 0.0.5 is out and has a snappy demo. \n\nWhat’s different about Myrtle?\nIt allows for multiple, intermittent reward channels. This is to allow for ongoing human feedback, but not to read anything into it when the human gets distracted or steps out for a coffee.\nhttps://cdn.masto.host/recsyssocial/media_attachments/files/112/583/533/184/204/595/original/20bbd6a47a7104d3.png\n",
"sig": "17dd17de778ad803252e39699ba04b08f608715becc42d9763af2f3794dd84363620828b7e5c8aad507d1bf7a4bd4828f703bed06e968e2a284752db06749204"
}