Jeff Lockhart
Cat person. Sometimes sociologist of science, sex, & other stuff.
- Having looked at a lot of the grok posts, and just given how LLMs work, seems pretty clear they just added some propaganda text into the context window. You'd get the same answers if you put that in your prompt. I strongly doubt they changed the underlying LLM, just had the interface insert more txt
-
View full thread👀
- I s2g the amount of LLM "news" that is just people rediscovering 2022 era reddit jailbreaking tactics...
- So in that sense, this is no different from the other system prompting and such that always happen when the interface secretly changes the user input before it goes to an LLM, and then changes the LLM output before it returns to the user.
- Carl is of course correct here. What the bot says is unreliable for how it works, and also any proprietary LLM will have people making choices about the interface that guide outputs without user knowledge/understanding. This is just a really hamfisted example of what is always happening.
- [Not loaded yet]
- I don't think one farms, uh, traditional dairy from bulls.
- The extent to which "to evaluate the LLM, we asked an LLM" has been normalized as a research method is truly disheartening.
- (I cropped their name out because it's not about this one example. Leave the paper and authors in peace.)
- My little social scientist joy for the day: being sampled by NORC.
- [Not loaded yet]
- best spam call!
- [Not loaded yet]
- Or, at least, to something that looks like point B. Disney land has some pretty convincing castles, I hear.
- Your friendly neighborhood book.
- [This post was deleted]
- Condolences, but you're often on the discover feed. More now that I follow you, so I think it's different for everyone, but you were there for me even before I knew you.
- Every time I see it in writing, I get excited all over again.
- [Not loaded yet]
- Thank you!
- [Not loaded yet]
- Thank you!
- [Not loaded yet]
- the cv paper I didn't know I needed.
- [Not loaded yet]
- [Not loaded yet]
- Aww, thank you both!
- Congrats 🎉
- [Not loaded yet]
- [Not loaded yet]
- This whole world of open source tools generally don't need admin access to run them. Was mostly fine when only techy nerds tinkered with stuff the downloaded from GitHub, but genAI has brought a ton of newbies into that scene.
- [Not loaded yet]
- well behaved linguists rarely make history
- [Not loaded yet]
- Not now, unreasonable desire for neatly arranged fiber optic networking cables, I'm trying to read about pollution.
- Genuinely no idea how people use language models to clean up OCR results??? Just tried with ChatGPT's latest model and the results are both terrible and extremely prone to hallucination. This is from an English 1985 typewritten document with perfect lighting and pretty simple OCR errors?
-
View full threadAnd before anyone takes me to task for using it in the first place: I am writing a paper on OCR, and you have to compare stuff to LLM output now. Take it up with the reviewers.
- haaate that this is now essentially required, but it is. In a lot of domains where we had decent tools already.
- I fully got one version that was like 70% hallucinated. If you have to spend 40m tuning the prompt and *checking the output*, calling the model over and over again, or writing a script to chunk the input into nice little bites for it, it’s just not an improvement over the SOP.
- smh with this 'have to' rhetoric. These are machines born of the basest "don't wanna" impulse.
- Well, you see, they ask the chatbot if it did a good job and it tells them that yes it did.
- On brand: if you paste the name of a Nvidia product into the search bar on the Nvidia website, their chatbot writes a summary of the product specifications in real time. As if they don't have a whole product page for exactly that. (Which I had to scroll down to find because the LLM text was so long)
- A gay with altman's money should know... www.ft.com/content/b180...
- This isn't really true. The reaction to Nazi eugenics was key to its disavowal by the US academy. Frederick Osborn, in charge of the Morale Branch of the Department of War during WWII was also head of the American Eugenics Society, and he came out of the war newly opposed to *racial* eugenics...
- [Not loaded yet]
- This is so clear from even the first few glances at mid century American eugenics stuff. Frustrating when people believe otherwise. (But also always a fan of you plugging your much-more-than-a-glance work here!)
- [Not loaded yet]
- Devastating when it's someone you generally like and now you get to wonder if they're this wrong on everything and you just don't have the expertise to notice in the other areas.
- [Not loaded yet]
- New coal reef restoration plan.
- [Not loaded yet]
- Tragic discovery: my natural greys don't hold color nearly as well as the bleached hair.
- [This post could not be retrieved]
- I just use a password manager and save the answer as a random set of words. Kinda annoying because I have to remember which sites are real answers, but I'd never be able to remember this one anyway
- The correcthorsebatterystaple strategy is remarkably sound
- [Not loaded yet]
- [Not loaded yet]
- "I love trains," I say, sniffling because Amtrak can't get the support to love me back
- Proud of both of you.
- I love Donna so much.
- She actually likes her new carrier. It's a miracle.
- [Not loaded yet]
- I need to start a Neat Old Infrastructure collection with this, NYC subway switching, etc.
- [Not loaded yet]
- [Not loaded yet]
- It's excruciating but also very informative.
- Oh I'll look for the video. Silke posted it earlier I think. They clipped a speech he gave about the value of DEI and the country's history of racism, then spent 2 minutes saying "I don't know this man but based on that clip he's the devil"
- I generally try not to link to the bad place, but this clip is just too fun not to share. x.com/ByronDonalds...
- [Not loaded yet]
- After that fox segment? Yeah I bet any allies he has at UF pushed for this.
- [Not loaded yet]
- Warm toe beans
- Academic social science buddies: If you had about $12,000 you needed to spend on survey research before July 1, would you bank it at Lucid, which has a lower quality respondents but charges ~$1.50/complete, or one of the better vendors that charge ~$4.50/complete?
- [Not loaded yet]
- Quality > quantity here. If you want noisy biased proxies you can get them free.
- Wearing my best professor outfit today and the bus driver slays me with "hello young man."
- [Not loaded yet]
- 95% of the time that's a blessing.
- [Not loaded yet]
- When NYMag does it, sure. But when I do it on bsky surely it's legitimate.
- Oh is that all it costs? Hmm...
- It's good to know my increasingly grey hair isn't disqualifying tbh
- [Not loaded yet]
- Well, see, I wanted to look at a lot of chatgpt outputs, so I asked chatgpt to summarize them, and...
- [Not loaded yet]
- 🎶climb every mountain 🎶