Llms
- AIAI · 1 min
Rio de Janeiro's Nex-N2 model merges existing LLMs for local use
Rio de Janeiro's Nex-N2 large language model (LLM) is a hybrid created by merging two existing models, Nex-N2_pro and Qwen, with respective weights of 0.6 and 0.4.
15 Jun, 12:07 am IST - AIAI · 2 min
LLMs use tactical nukes in 95% of simulated nuclear crisis games
A recent study by Kenneth Payne examined how leading Large Language Models (LLMs) handle simulated nuclear crisis scenarios, finding that they deploy tactical nuclear weapons in 95% of simulations.
12 Jun, 06:08 am IST - AIAI · 2 min
Lathe uses LLMs to generate hands-on technical tutorials on demand
Lathe, a new tool launched on GitHub, leverages large language models (LLMs) to create multi-part, hands-on technical tutorials tailored to users learning new domains.
08 Jun, 12:08 am IST - AIAI · 2 min
Researcher spends $1,500 testing if LLMs can hack vulnerable app
Kasra Rahjerdi, a security researcher, spent $1,500 to test whether large language models (LLMs) could exploit vulnerabilities in a deliberately insecure app he built.
04 Jun, 12:05 pm IST - AIAI · 2 min
Five frontier LLMs disagree on 67% of 1k real-world fact-check claims
Five leading large language models (LLMs) disagreed on the verdict for 67% of 1,000 real-world fact-check claims, according to a study by Lenz Research (lenz.io).
28 May, 07:23 pm IST - AIAI · 2 min
The last six months in LLMs in five minutes
The last six months have seen significant developments in large language models (LLMs), with a notable inflection point occurring in November 2025, particu…
19 May, 04:26 pm IST - DEVTOOLS · OPEN SOURCEDEVTOOLS · OPEN SOURCE · 2 min
GitHub’s 181-star whichllm tool ranks local LLMs by real benchmarks, not parameters
A single-command Python tool helps users pick the best-performing local LLM for their hardware, using recency-aware benchmarks instead of parameter count.
15 May, 05:56 pm IST