Conversational Audio AIsGemini’s multi-model Live and elevan labs conversational AI both can take audio as input and generate audio as response.Dec 27, 2024Dec 27, 2024
Gemini2 Multimodal Live API — Quick LookWorks on Websocket. Each Websocket connection is a unique session.Dec 20, 2024Dec 20, 2024
Product Market Fit Through Iterative User InteractionsA new user interface (similar to touch for mobile or game pad for consoles) enables a new way of thinking and building user interactions.Jun 15, 2024Jun 15, 2024
ML/Deep Learning As A New User InterfaceA unique user interface (Keyboard, Mouse, Touch, Multi-Touch, VR gestures, Joy stick, Game Pad) brings a new way of thinking & building…Jun 12, 2024Jun 12, 2024
Flutter Layout and ConstraintsFlutter layout works on the principle of “Constraints go down. Sizes go up. Parent sets position” (See…May 4, 2024May 4, 2024
Venture ScoutsRecetly I have contributed to an article on “How to break into the VC industry?”. Sharing the same here.Dec 31, 2023Dec 31, 2023
GenAI & Declarative User InterfacesDiscussion on imperative & declarative user interfacesSep 25, 2023Sep 25, 2023
Real time video switchI am programming a realtime video switch that allows one choosen camera feed to be pushed to the users among list of live video feeds from…May 20, 2023May 20, 2023
Text embedding & indexing similarities for fast comparisonText embeddings help us measure the relatedness of (paragraph of) texts in the context of a LLM.May 20, 2023May 20, 2023