Davriellelouna@lemmy.world to Technology@lemmy.worldEnglish · edit-22 days agoThe AI company Perplexity is complaining their bots can't bypass Cloudflare's firewallwww.searchenginejournal.comexternal-linkmessage-square215fedilinkarrow-up1786arrow-down14
arrow-up1782arrow-down1external-linkThe AI company Perplexity is complaining their bots can't bypass Cloudflare's firewallwww.searchenginejournal.comDavriellelouna@lemmy.world to Technology@lemmy.worldEnglish · edit-22 days agomessage-square215fedilink
minus-squarethreeganzi@sh.itjust.workslinkfedilinkEnglisharrow-up2·15 hours agoDoes it not need to be scraped to be indexed, assuming it’s semi-typical RAG stuff?
minus-squareElectricd@lemmybefree.netlinkfedilinkEnglisharrow-up1·14 hours agoI assume their script does some search engine stuff like query google or bing and then “scrap” the links they go on Some selenium stuff
Does it not need to be scraped to be indexed, assuming it’s semi-typical RAG stuff?
I assume their script does some search engine stuff like query google or bing and then “scrap” the links they go on
Some selenium stuff