News Score: Score the News, Sort the News, Rewrite the Headlines

The First Fully General Computer Action Model

We designed FDM-1, a foundation model for computer use. FDM-1 is trained on videos from a portion of our 11-million-hour screen recording dataset, which we labeled using an inverse dynamics model that we trained. Our video encoder can compress almost 2 hours of 30 FPS video in only 1M tokens. FDM-1 is the first model with the long-context training needed to become a coworker for CAD, finance, engineering, and eventually ML research, and it consistently improves with scale. It trains and infers d...

Read more at si.inc

© News Score  score the news, sort the news, rewrite the headlines