thanks! the math and architecture of the FDM (no video encoder) is pretty simple...

		nee1r 35 days ago \| parent \| context \| favorite \| on: The First Fully General Computer Action Model thanks! the math and architecture of the FDM (no video encoder) is pretty simple, its a regular transformer with next-token predictions but with frames interleaved.