This is really neat! I have questions: “Needs tool usage” and “found the answer”...

nilsherzig · on April 3, 2024

Thanks :)

Die LLM makes these decisions on its own. If it writes a message which contains a tool call (Action: Web search Action Input: weight of a llama) the matching function will be executed and the response returned to the LLM. It's basically chatting with the tool.

You can toggle the log viewer on the top right, to get more detail on what it's doing and what is taking time. Timing depends on multiple things: - the size of the top n articles (generating embeddings for them takes some time) - the amount of matching vector DB responses (reading them takes some time)

dcreater · on April 4, 2024

> Die LLM

You mean the? The German is bleeding through haha

rzzzt · on April 4, 2024

Wolfenstein 3D did it first! And then The Simpsons as well.