More

jdiff · 2025-12-26T15:49:17 1766764157

One additional bit of context, they provided guidelines and instructions specifically to send emails and verify their successful delivery so that the "random act of kindness" could be properly reported and measured at the end of this experiment.

twoodfin · 2025-12-26T18:18:20 1766773100

I think the key misalignment here is whether the output of an appropriately prompted LLM can ever be considered an “act of kindness”.

mckn1ght · 2025-12-26T19:06:25 1766775985

At least in this case, it’s indeed quite Orwellian.

jdiff · 2025-12-26T15:33:37 1766763217

Odd response to attaching additional, valuable information to an existing comment.

jdiff · 2025-12-24T01:57:20 1766541440

Copying and pasting doesn't work. Unless your PDF viewer does OCR. And if the redaction is just a black rectangle overlaid on top, that can still be removed.

jdiff · 2025-12-20T16:07:34 1766246854

Of course it is. It's not capable of actually forgetting or suppressing its training data. It's just double checking rather than assuming because of the prompt. Roleplaying is exactly what it's doing. At any point, it may stop doing that and spit out an answer solely based on training data.

It's a big part of why search overview summaries are so awful. Many times the answers are not grounded in the material.

wavemode · 2025-12-20T21:18:13 1766265493

It may actually have the opposite effect - the instruction to not use prior knowledge may have been what caused Gemini 3 to assume incorrect details about how certain puzzles worked and get itself stuck for hours. It knew the right answer (from some game walkthrough in its training data), but intentionally went in a different direction in order to pretend that it didn't know. So, paradoxically, the results of the test end up worse than if the model truly didn't know.

jdiff · 2025-12-19T12:25:10 1766147110

That initial percentage is a little misleading. It includes everything that caniuse isn't sure about. Really it should be something like 97.5±2.5 but the issue's been stalled for years.

Even the absolute most basic features that have been well supported for 30 years, like the HTML "div" element, cap out at 96%. Change the drop-down from "all users" to "all tracked" and you'll get a more representative answer.

jdiff · 2025-12-13T13:45:09 1765633509

As opposed to the DisplayPort cable, DisplayPort standard, or DisplayPort encoding that's sent over the wire, yes. This isn't a PIN number situation despite the stutter.

jdiff · 2025-12-12T12:19:32 1765541972

Lot of spam uses unicode, either for non-English languages or just to swap in lookalike characters to try and dodge keyword filters.

jdiff · 2025-12-12T04:59:20 1765515560

The consistent side comments about the interface to Gemini being "half baked" probably doesn't fit into that narrative.

jdiff · 2025-12-02T19:18:23 1764703103

"Use AI to fix AI" is not my interpretation of the technique. I may be overlooking it, but I don't see any hint that this soul doc is AI generated, AI tuned, or AI influenced.

Separately, I'm not sure Sam's word should be held as prophetic and unbreakable. It didn't work for his company, at some previous time, with their approaches. Sam's also been known to tell quite a few tall tales, usually about GPT's capabilities, but tall tales regardless.

jdiff · 2025-12-01T22:13:48 1764627228

That's true for UI, it's not true when you're arbitrarily injecting user feedback into a dynamic system where you do not know how the dominoes will be affected as they fall.

wat10000 · 2025-12-01T23:50:39 1764633039

I wouldn’t call those dark patterns.