While voice AI facilitates communication greatly, it is little constrictive.
Let's say if you want your voice chatbot to share a YouTube link with your users, it will be totally useless. As spelling out the link doesn't make sense. This dependence narrows its range of capabilities. However, when integrated with visual elements, it becomes far more practical and can be utilized more broadly in business operations.
Like If I say to it using my voice that I want to learn more about the recent tech event of a company. The AI shares a YouTube video of the event with me and asks me to have a look at it. Having something like that, which facilitates engagement and communication with both audio and visual elements together felt like an interesting idea to me.
And hence I built Vani.