svoboda on Nostr: So saw a post on Twatter where someone showed DeepSeek was referencing Baidu. Then ...
So saw a post on Twatter where someone showed DeepSeek was referencing Baidu. Then did some cursory searching and it appears this is not a startup at all, but built upon Baidu's Ernie which came out in 2023.
Moreover, Baidu is essentially the Chinese Google, and aggressively scrapes websites even if you try to block it (via robots.txt or firewall IP blocking) so I am now FULLY calling bullshit on their claims. If they've leveraged their search datasets, I can see how they wouldn't need as many GPUs because they've got decades worth of intrusive ass indexing and whatever they've done with that data over time.
Moreover, Baidu is essentially the Chinese Google, and aggressively scrapes websites even if you try to block it (via robots.txt or firewall IP blocking) so I am now FULLY calling bullshit on their claims. If they've leveraged their search datasets, I can see how they wouldn't need as many GPUs because they've got decades worth of intrusive ass indexing and whatever they've done with that data over time.