☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to

technology@hexbear.netEnglish · 11 days ago

An Open Source Conversation Response Path Exploration System using Monte Carlo Tree Search

9

24

An Open Source Conversation Response Path Exploration System using Monte Carlo Tree Search

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to

technology@hexbear.netEnglish · 11 days ago

9

GitHub - MVPandey/CAE: A fully functional LLM chat backend with FastAPI and Async operations, with a built in MCTS conversation analyzer

A fully functional LLM chat backend with FastAPI and Async operations, with a built in MCTS conversation analyzer - MVPandey/CAE

Instead of just generating the next response, it simulates entire conversation trees to find paths that achieve long-term goals.

How it works:

Generates multiple response candidates at each conversation state
Simulates how conversations might unfold down each branch (using the LLM to predict user responses)
Scores each trajectory on metrics like empathy, goal achievement, coherence
Uses MCTS with UCB1 to efficiently explore the most promising paths
Selects the response that leads to the best expected outcome

Limitations:

Scoring is done by the same LLM that generates responses
Branch pruning is naive - just threshold-based instead of something smarter like progressive widening
Memory usage grows with tree size, there currently no node recycling

Chat

Owl [he/him]@hexbear.net
link
fedilink
English
arrow-up
7·
11 days ago
Damnit, I saw MCTS and thought it’d be something neat, but then it’s LLMs because of course every piece of tech news is LLMs.
- Skye [she/her, they/them]@hexbear.net
  link
  fedilink
  English
  arrow-up
  3·
  11 days ago
  Making the LLM use an LLM to figure out what to say almost feels like a pretty good tech news shitpost

technology@hexbear.net

technology@hexbear.net

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@hexbear.net

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

1. Obviously abide by the sitewide code of conduct. Bigotry will be met with an immediate ban
2. This community is about technology. Offtopic is permitted as long as it is kept in the comment sections
3. Although this is not /c/libre, FOSS related posting is tolerated, and even welcome in the case of effort posts
4. We believe technology should be liberating. As such, avoid promoting proprietary and/or bourgeois technology
5. Explanatory posts to correct the potential mistakes a comrade made in a post of their own are allowed, as long as they remain respectful
6. No crypto (Bitcoin, NFT, etc.) speculation, unless it is purely informative and not too cringe
7. Absolutely no tech bro shit. If you have a good opinion of Silicon Valley billionaires please manifest yourself so we can ban you.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

228 users / day
706 users / week
1.15K users / month
1.16K users / 6 months
1 local subscriber
23.9K subscribers
215 Posts
905 Comments
Modlog