They did show some example-videos in the article. From the Beatles-video, this looks quite a few steps away from working well. When they change camera, walls etc. suddenly changes colors, as the images are processed separately.
Would it be require a lot of changes to repeat the training process mentioned in the article with video instead of stills? Aside from the increase in data being processed.