it is open-source, if they did something like this, we would know it for sure
Privacy
A place to discuss privacy and freedom in the digital world.
Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.
In this community everyone is welcome to post links and discuss topics related to privacy.
Some Rules
- Posting a link to a website containing tracking isn't great, if contents of the website are behind a paywall maybe copy them into the post
- Don't promote proprietary software
- Try to keep things on topic
- If you have a question, please try searching for previous discussions, maybe it has already been answered
- Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
- Be nice :)
Related communities
much thanks to @gary_host_laptop for the logo design :)
DeepSeek does the same things that OpenAI does, but it's a foreign actor so OOooooOOWwwwooOOOO sCaRrRey!
Wait until they hear what data Instagram/Meta collects during use!
But they're a US company so it's ok.
Realistically what is the worst thing China is doing with your private data? Selling it? If you’re not a Chinese National, at least you don’t fall under their jurisdiction.
If you’re a U.S. citizen, with all the tech oligarchs cozying up to the current administration, I’d be a lot more concerned with Facebook/Twitter/Etc collecting your data.
This "China's AI is taking your data and that's bad" is shockingly similar to "TikTok is taking your data and that's bad". Lots of US counterparts do the same thing, but I don't see (as much) media coverage about that.
Don Draper: "no no no, everyone else's cigarettes are dangerous. Lucky Strikes are... toasted."
The way I think of it is, I don't live in China, so regardless of my objections to their values or human rights abuses, why would CCP or an affiliated company care about me or ruin my life on the basis of or by abusing my data? A big part of why I care about privacy is I don't want to be filtering my every thought through consideration of whether the powers that be would approve, and US companies are way more relevant to that.
This is probably only a problem with the online version. In contrast to google and openAI they, like meta, let you download the model and run it offline, where they can't access any of this data I presume.
I've been running it locally using ollama, works completely offline, no keystroke data for anyone!
Anyone using DeepSeek as a service the same way proprietary LLMs like ChatGPT are used is missing the point. The game-changer isn’t that a Chinese company like DeepSeek can compete with OpenAI and its ilk—it’s that, thanks to DeepSeek, any organization with a few million dollars to train and host their own model can now compete with OpenAI.
They all do this...
Don't use hosted models unless you pay for your own server space and it is encrypted.
Don't be a fucking idiot.
No.
As opposed to Microsoft, Google, .. NSA, or GCHQ servers. Or all of the above.
"We store the information we collect in secure servers located in the People's Republic of China"
Now you Americans know how we Europeans feel when Google, Amazon and Facebook store our information on American servers. Hint: The protective wall between Chinese servers and their government are about as good as the one between American servers and their government - at least for non-US citizens. The last thin veil of privacy for Eurpeans has been ripped to shreds by Trump last week.
Did the American technology giants think they had the monopoly on capturing human input too?
I'm confused. Isn't "collecting keystroke data" just an alarmist way to describe text entry?
This is the full paragraph:
We collect certain device and network connection information when you access the Service. This information includes your device model, operating system, keystroke patterns or rhythms, IP address, and system language. We also collect service-related, diagnostic, and performance information, including crash reports and performance logs. We automatically assign you a device ID and user ID. Where you log-in from multiple devices, we use information such as your device ID and user ID to identify your activity across devices to give you a seamless log-in experience and for security purposes.
It looks to me that they are using it to identify the user uniquely, maybe also related to captcha to prevent bots (it's common practice to capture mouse and keyboard while resolving captchas to see if the movement is human-like).
Looks like there are more things I need to start randomizing and injecting with noise.
If you think the American companies do anything different you're not paying attention and simply believing the propaganda.
Chinese company does what American companies have done for 25+ years now!
Is it time for REAL data privacy laws or are we just gonna keep playing whack-a-mole with Chinese tech companies that get us nowhere?
This article is what US propaganda looks like folks. Mashable should be ashamed.
Literally all AI companies do this to run their services. Except you can actually download Deepseek and run it completely securely on your own devices. You know who doesn't allow that security? OpenAI and the other US companies currently being screwed.
Same as Chrome's magic bar, or android keyboard no ? So in the end, does USA doing it good because "democracy" (never ever with napalm) when China is bad because human rights violation (USA never did anything like this) ?
Nope, At least we can check DeepSeek's source code
Unlike OpenAI..... oops I meant ClosedAI
Not excusing Chinese companies but everyone does the same shit. I bet a lot of US companies that behave the same or worse will be looking for trade barriers to protect their business so their interests will be stoking fear of Chinese competitors. I don't really give a shit which country is doing it, I am not buying what they are selling.
US companies have a stranglehold on government, education and business and are getting access to my families data despite my personal objections. Far more concerned about that than a Chinese service I have no intention of using.
Deepseek can at least be self hosted if you want AI in your life. I can happily live without it.
the company states that it may share user information to "comply with applicable law, legal process, or government requests.
Literally every company's privacy policy here in the US basically just says that too.
Not only does DeepSeek collect "text or audio input, prompt, uploaded files, feedback, chat history, or other content that [the user] provide[s] to our model and Services," but it also collects information from your device, including "device model, operating system, keystroke patterns or rhythms, IP address, and system language."
Breaking news, company with chatbot you send messages to uses and stores the messages you send, and also does what practically every other app does for demographic statistics gathering and optimizations.
Companies with AI models like Google, Meta, and OpenAI collect similar troves of information, but their privacy policies do not mention collecting keystrokes. There's also the added issue that DeepSeek sends your user data straight to Chinese servers.
They didn't use the word keystrokes, therefore they don't collect them? Of course they collect keystrokes, how else would you type anything into these apps?
In DeepSeek's privacy policy, there's no mention of the security of its servers. There's nothing about whether data is encrypted, either stored or in transmission, and zero information about safeguards to prevent unauthorized access.
This is the only thing that seems disturbing to me, compared to what we'd like to expect based on the context of what DeepSeek is. Of course, this was proven recently in practice to be terrible policy, so I assume they might shore up their defenses a bit.
All the articles that talk about this as if it's some big revelation just boil down to "company does exactly what every other big tech company does in America, except in China"
Your devices keyboard app has been collecting all of your keystrokes.
HuggingChat is open source and lets you use DeepSeek. It also doesn't censor results like the main app (allegedly) does.
haha, now do openai
I swear people do not understand how the internet works.
Anything you use on a remote server is going to be seen to some degree. They may or may not keep track of you, but you can't be surprised if they are. If you run the model locally, there is no indication it is sending anything anywhere. It runs using the same open source LLM tools that run all the other models you can run locally.
This is very much like someone doing surprised pikachu when they find out that facebook saves all the photos they upload to facebook or that gmail can read your email.
other ai services do too. u might not realize it.
Yeah, uh... If you think that American companies aren't doing this same thing and handing your data over to the government without a warrant among other bad uses, I have some bad news for you. This is pretty much par for the course, and I'm pretty sure that we're witnessing a well financed negative media blitz happening to try and keep OpenAI from getting all of its spaghetti spilled. Watch for the government to try and ban deepseek for "national security" reasons soon.
Any ChatAI logs your keystrokes and your inputs to work and update their LLM. The PP and TOS is the same and even better as those from the US competitors. DeepSeek is OpenSource
Anyway I prefer Andisearch and its PP, the best of all these big tech AIs.
Chinese company uses servers located in China. More news at 11.
detective conan sure had a hard time cracking the case!
"The personal information we collect from you may be stored on a server located outside of the country where you live. We store the information we collect in secure servers located in the People's Republic of China," the privacy policy reads.
Oh the horror! Let's look at what our glorious spawns-of-techbro heroism has for us in store:
spoiler
OpenAI processes your Personal Data for the purposes described in this Privacy Policy on servers located in various jurisdictions, including processing and storing your Personal Data in our facilities and servers in the United States. While data protection law varies by country, we apply the protections described in this policy to your Personal Data regardless of where it is processed, and only transfer that data pursuant to legally valid transfer mechanisms.
spoiler
When you access our website or Services, your personal data may be transferred to our servers in the US, or to other countries outside the European Economic Area (“EEA”) and the UK. This may be a direct provision of your personal data to us, or a transfer that we or a third party make.
So not only is your data "possibly" stored in one country, now there's a possibility of it being stored in many different countries. Where's the outcry for that?
Ok, so maybe your data being under the jurisdiction of another country is sus, right?
In another section about how DeepSeek shares user data, the company states that it may share user information to "comply with applicable law, legal process, or government requests."
OH MY GOD SOUND THE ALARM!
ChatGPT:
spoiler
We may use Personal Data for the following purposes: [...] To comply with legal obligations and to protect the rights, privacy, safety, or property of our users, OpenAI, or third parties.
Claude:
spoiler
Pursuant to regulatory or legal requirements, safety, rights of others, and to enforce our rights or our terms. We may disclose personal data to governmental regulatory authorities as required by law, including for legal, tax or accounting purposes, in response to their requests for such information or to assist in investigations. We may also disclose personal data to third parties in connection with claims, disputes or litigation, when otherwise permitted or required by law, or if we determine its disclosure is necessary to protect the health and safety of you or any other person, to protect against fraud or credit risk, to enforce our legal rights or the legal rights of others, to enforce contractual commitments that you have made, or as otherwise permitted or required by applicable law.
So not only can your data be subject to the authorities, but it's also handed out to 3rd parties (mind you, DeepSeek does the exact same, so why is it any surprise?).
Not only does DeepSeek collect "text or audio input, prompt, uploaded files, feedback, chat history, or other content that [the user] provide[s] to our model and Services," ...
🤦... You get the idea now, bother yourself with the privacy policies of the respective contemporaries and CTRL + F to "User Content" or "User Input".. Same fucking shit.
Companies with AI models like Google, Meta, and OpenAI collect similar troves of information, but their privacy policies do not mention collecting keystrokes.
Yes, collecting keystrokes is probably the oddest thing here. To compare data farming giants with a decade and a half's worth of data collection to a startup in terms of data collection is so astronomically dumb.
I could go on but I'm bored now. Do your own research.