Thus far, generating video from text has been as bit clunky. It was hard to maintain character consistency, and it was hard ...