Talking about Alpha Evolve https://deepmind.google/discover/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/ ?
First, Microsoft isn’t using this yet but even if they were it doesn’t work in this context. What Google did was they wrote a fitness function to tune the Generative process. Why not have some rubric that scores the code as our fitness function? Because the function needs to be continuous for this to work well, no sudden cliffs. But also they didn’t address how this would work in a multi-objective space, this technique doesn’t let the LLM make reasonable trade offs between complexity and speed.
Don’t even need trials apparently.