melvincarvalho on Nostr: Claude 3.7 Sonnet is phenomenal. It scores 70% on SWE-Bench which is some of the ...
Claude 3.7 Sonnet is phenomenal. It scores 70% on SWE-Bench which is some of the hardest coding problems around. To put into context 20% was state of the art in open source a few months ago. Elon made a huge mistake not open sourcing grok 3, or even grok 2. Their massive data centers are already ancient history. FOSS is the new competitive advantage.
Published at
2025-02-24 19:42:04Event JSON
{
"id": "a4dfef4cd63a508b047dd973911841dd53473579d9b314f56e64cf0dc5866a7e",
"pubkey": "de7ecd1e2976a6adb2ffa5f4db81a7d812c8bb6698aa00dcf1e76adb55efd645",
"created_at": 1740426124,
"kind": 1,
"tags": [
[
"imeta",
"url https://media.ditto.pub/69f263c666271a5b469cc5b6dc67735ee3a669a1dc6e2e11e90f9699b807ac62.png",
"m image/png",
"x 69f263c666271a5b469cc5b6dc67735ee3a669a1dc6e2e11e90f9699b807ac62",
"size 215313",
"dim 961x737",
"blurhash U1S$ou4Tt0.TV?%M%M%MD$%2M|oz%Kt7WBxa"
]
],
"content": "Claude 3.7 Sonnet is phenomenal. It scores 70% on SWE-Bench which is some of the hardest coding problems around. To put into context 20% was state of the art in open source a few months ago. Elon made a huge mistake not open sourcing grok 3, or even grok 2. Their massive data centers are already ancient history. FOSS is the new competitive advantage.\n\nhttps://media.ditto.pub/69f263c666271a5b469cc5b6dc67735ee3a669a1dc6e2e11e90f9699b807ac62.png",
"sig": "b40e8d1bdeec6ff5287f99943b038b72a265d080055e4cf5a8b5007f688d0ba8bf76e64dfad5738a871d1391878cd010aac8870ebb1d65ac1db9f8bf73c6f12f"
}