I don't have anything against stitching together clips to tell your story, but I'm unconvinced that these demonstrate anything like that. As I said in another comment, it seems like you'd need to write a screenplay PLUS all the information the director, cinematographer, etc. use to create an actual movie -- everything from direction for how actors portray scenes to decisions on exactly how shots are constructed, to blocking for multiple actors in a scene, to color schemes...
There are a LOT of choices in making a movie, and if you just let the AI make them, you are getting "random" (uncontrolled) choices. I don't think that is going to compare favorably to the real thing.
If you can specify all that, then it's just a tool. Cool. But it's still going to take pro-level skills to use it.
If La Jetee was just some photos stitched together plus meaningful narration, then of course, you could use AI-generated photos.
But would AI be able to quote Vertigo, like La Jetee does? Doesn't art, at least to some degree, require intent (including all intentional subversions of that intent dogma, of course)?
If you disagree with that, you're basically saying La Jetee isn't art, which would be a hard sell.