Generative AI Weekly - Apr. 4, 2024

April 4, 2024

This week:

Using ChatGPT for UX research - experimental results
More competitions and benchmarks for AI progress - the AI Mathematical Olympiad "this competition uses a dataset of 110 novel math problems… from simple arithmetic to algebraic thinking and geometric reasoning"
Podcast of Ethan Mollick and Erza Klein on How to Use AI right now. "Mollick says it’s helpful to understand this moment as one of co-creation, in which we all should be trying to make sense of what this technology is going to mean for us."
Evidence for extensive appearance of chatGPT/LLM derived text in scholarly papers.. A paper finding signals and evidence of GPT use. Another one for medical papers here.
Large language models, explained with a minimum of math and jargon
"The 18 most interesting startups from YC’s Demo Day show we’re in an AI bubble"
The 2024 MAD (ML, AI & Data) Landscape
Biased source, but "16 Changes to the Way Enterprises Are Building and Buying Generative AI". I think the most useful indicator here to see is that the majority of use cases are internal facing (not customer facing) for now.
In this paper on a new evaluation benchmark for financial questions, there is evidence that giving context before asking a question returns better results. "Showing the relevant context (i.e., filing or evidence extract) before the question leads to significant performance improvements over showing the context after the question."
"In 1979, IBM—like all technology businesses—had a meeting about future technology, both hypothetical and real…. Now, a single photograph from this meeting's presentation packet has resurfaced after years of circulating the internet, ready for its moment nearly 45 years later."
Fact-checking via LLM seems to outperform crowdsourced human annotators… 20x cheaper.