πŸ˜‹ AGI (bark 🐢) Smart waitress πŸŽ™οΈ

adriens - May 31 '23 - - Dev Community

❔ About

With this post you'll see how I started my first full artwork creating a bridge between:

  • πŸ“œ Data
  • πŸŽ™οΈ Sound design
  • πŸ€– Generative Text To Speech
  • πŸ–ŒοΈ Video artwork
  • πŸ“’ Digital contents streamline
  • πŸ“ˆ Social networks and content embedding

πŸ’‘ Inception

What triggered this creation is the following tweet:

... I immediately started to think:

"... and if I could create a fully digital, multimodal Customer Experience that would be ready to be shared on social platforms ?"

☝️ Also, people are talking a lot about about AGI like Midjourney, DALL-E... but very much less about Generative AI for TTS (Text to Speech).

♾️ " Voice prompts," aka. "History prompts"

As all others AGI, suno-ai/bark makes no exception : it relies on "PROMPTs".

Image description

Luckily, the bark's community is very active and share their voices prompt (and tags) discoveries :

Image description

πŸ” Creative workflow

Here is the current workflow I could experiment:

  1. Create & release a SDK to get the data
  2. Imagine a customer experience at restaurant
  3. Develop & tune the data driven script and build soundtrack
  4. Create an avatar and scene for the waitress
  5. Put together soundtrack & avatar into video

🧰 Tools

Here are the open source tools I used for now:

🍿 Demos

Below are the demos:

πŸ€“ How it's built (author's words)

πŸŽ™οΈ Soundtrack

Output soundtrack with bark:

🎞️ Movie

Then put the sound into an avatar with SadTalker:

πŸ€” Ideas for "later"

Automate:

  1. Video creation
  2. Video upload on dedicated cloud services for further optimal collaboration, digital marketing,...
  3. Avatar creation so video is totally code driven... and makes content more original (and funny) on each release thanks to one time generative prompt (prompt design required)

↩️ Conclusion

The more I think about designing - and achieving - such experiences, the more I find evident the core of this kind of project is:

  • 🎯 Get a clear idea and be strongly focused on what you want to achieve (ie. you don't get lost in your creative journey)
  • πŸ”— Design a clean linear workflow that focus on tasks (not tools) so you can adapt it easily as AI projects are evolving at an amazing pace (I mean every week there are new tools)

πŸ”– Resources

πŸ”­ Tools to prototype

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Terabox Video Player