How does AI follow ethical guidelines in Data Collection?
r/artificial
•
Robotics
Hot take: if I wanted to gather data via the internet, and I’m writing scripts/code to speed up the process, I have to follow some basic rules (ie look at the sitemap, find relevant robots.txt, follow that websites preference and rules). But it seems any AI-agent I’ve used does not give af about rules and limits, and is totally cool building me a scraper that will perform hundreds of thousands of requests without regards to the website owner’s preference.