I'd like to contribute with the ability to include inline assets (such as images, youtube videos, etc). Is the level of effort on this unrealistically high? I see that the data model focuses on individuals characters syncing over Swarm, but has the door been left open for non-text elements?