DuckDuckGo offers “anonymous” access to AI chatbots through new service

nifty@lemmy.world · 1 year ago

DuckDuckGo offers “anonymous” access to AI chatbots through new service

tonyn@lemmy.ml · 1 year ago

How is the compute getting paid for?

Ghostalmedia@lemmy.world · 1 year ago

DDG makes money through ads and affiliate programs.

/home/pineapplelover@lemm.ee · 1 year ago

Oh yeah I might have to tell ublock to whitelist ddg so I can support them through ads

brbposting@sh.itjust.works · 1 year ago

Couple good points in the comments -

Using LLMs to avoid the blank page problem:

For AI, bring your own data:

demonsword@lemmy.world · 1 year ago

Ars Technica forums are alright, I usually take a look there whenever I read something on their site

Kecessa@sh.itjust.works · 1 year ago

Anonymous or not, you’re still feeding it data

just_another_person@lemmy.world · 1 year ago

Not how that works.

metallic_substance@lemmy.world · 1 year ago

I’m curious, how does it work?

RagingRobot@lemmy.world · 1 year ago

Not who you asked but you don’t want your AI to train itself based on the questions random users ask because it could introduce incorrect or offensive information. For this reason llms are usually trained and used in a separate step. If a user gave the llms private information you wouldn’t want it to learn that information and pass it on to other users so there are protections in place usually to stop it from learning new things while just processing requests.

Lung@lemmy.world · 1 year ago

These companies absolutely collect the prompt data and user session behavior. Who knows what kinda analytics they can use it for at any time in the future, even if it’s just assessing how happy the user was with the answers based on response. But having it detached from your person is good. Unless they can identify you based on metrics like time of day, speech patterns, etc

just_another_person@lemmy.world · edit-2 1 year ago

Prompt data is pointless and useless without a human to create a feedback loop for it, at which point it wouldn’t have context anyway. Also human effort to correct spelling dnd other user errors at the outset anyway. Hugely pointless and unreliable.

Not to mention, what good would it do for training? It wouldn’t help the model at all.

Lung@lemmy.world · 1 year ago

You can collect the data and figure out how to use it later. Just look at the Google leaks lately and what they collect, it’s literally everything down to the length of clicks and full walks through the site

Collecting data about user interests is in itself valuable, and it’s plausible to use various metrics to analyze it, something as simple as sentiment analysis, which has been broadly done. Sentiment analysis has predated modern ML by a long margin, but you can read the wiki page on that

But yeah just think about stuff like Google trends, tracking interest in topics, as an example of what such data could be used for. And deanonymizing the inputs is probably possible to some degree, aside from the obvious trust we place in DDG as a centralized failure point

just_another_person@lemmy.world · 1 year ago

You’re confusing analytics with direct input storage and reuse of prompt data to train somehow, as in your original comment.

Analytics has absolutely nothing to do with their model usage and training, and would pointless. Observing keywords and interests is standard analysis stuff. I don’t even think anyone even cares about it anymore.

Evotech@lemmy.world · 1 year ago

Not really. Depending on the implementation.

It’s not like ddg is going to keep training their own version of llama or mistral

regrub@lemmy.world · edit-2 7 months ago

deleted by creator

Evotech@lemmy.world · 1 year ago

But these open models don’t really take new input into their models at any point. They don’t normally do that type of inference training.

regrub@lemmy.world · edit-2 7 months ago

deleted by creator

Evotech@lemmy.world · 1 year ago

It’s true. But I trust them more than closedai or Ms at least

shotgun_crab@lemmy.world · 1 year ago

But that’s a human error as you said, the only way to fix it is by using it correctly as an user. AI is a tool and it should be handled correctly like any other tool, be it a knife, a car, a password manager, a video recording program, a bank app or whatever.

I think a bigger issue here is that many people don’t care about their personal information as much as their lives.

subtext@lemmy.world · edit-2 1 year ago

https://duckduckgo.com/duckduckgo-help-pages/aichat/ai-chat-privacy/

your conversations are not used to train chat models by DuckDuckGo or the underlying model providers

Even_Adder@lemmy.dbzer0.com · 1 year ago

https://simonwillison.net/2024/May/29/training-not-chatting/

𝔻𝔼𝕍𝕀𝕃𝕀𝕊ℍ@lemmy.world · 1 year ago

How anonymous is that thing ?
Ai needs data training & correction from us as user

nifty@lemmy.world · 1 year ago

You can train models of all kinds without disclosing anything personal about a user. Also see differential privacy

Autonomous User@lemmy.world · edit-2 1 year ago

I don’t see how we can prove this. Paying them to also spy on us is bad but allowing them replace our software c/localllama with their service is even worse. My funds are better spent on local AI development or device upgrade.