I’ve always believed that imitation is the sincerest form of flattery, so when I found a few articles from this blog copied elsewhere without attribution or links back to the originals, I was overjoyed. Someone — or apparently two people — thought that a total of four of my blog posts were worth plagiarizing.
When I looked a little more closely, though, I realized my mistake. My articles hadn’t been hand-picked with loving care by an author with writer’s block and exquisite taste; instead, they’d simply been snarfed up by software. One site, internetsatire.com, seems to just copy a random sampling of posts from the wordpress.com humor feed. I have no idea why a site with “satire” in its name would use the humor feed (which consists of all posts tagged “humor” in blogs hosted on wordpress.com) and not the equally convenient satire feed — but I think it would be satisfyingly ironic if this post were copied, so I’m adding both tags just to increase the odds.
The other site (really a family of sites: daoblogs.com and daoblog.co.cc) apparently gets articles from a variety of sources and then applies some kind of transformation to each one. It looks like they’re either translating them into some other language and back to English or using a thesaurus to replace individual words. Either way, the results can be kind of amusing; for example, this excerpt from my GFAJ-1 interview:
Unlikely Explanations: Thriving on arsenic the way you do is a major accomplishment. How did you do it?
GFAJ-1: It was a slow process that occurred over many generations. I won’t lie — initially, my family was as intolerant of arsenic as anyone else. But then arsenic started moving into the neighborhood, and we realized we’d have to adapt somehow.
turns into this on daoblogs:
Unlikely Explanations: Thriving without ceasing arsenic the way you do is a major accomplishment. How did you do it?
GFAJ-1: It was a inactive process that occurred over many generations. I won’t lie — initially, my family was in the same proportion that intolerant of arsenic as anyone otherwise. But then arsenic started moving into the vicinage, and we realized we’d accept to adapt somehow.
I’m not sure which version I like better. I’m also not sure why they bother modifying the text — they might be trying to change the wording to avoid showing up in searches for excerpts from the original, but the one thing they never seem to change is the title.
Apparently, the term for this kind of copying is blog scraping. I haven’t been plagiarized; I’ve just been scraped. It’s annoying, but not nearly as painful as it sounds.
In case anyone is curious — An Interview With GFAJ-1, the Arsenic-Eating Bacterium was copied to http://whatishot.daoblogs.com/9772.html, Better Living Through Candy: Creative and Practical Uses For Halloween Leftovers was copied to http://news.daoblogs.com/8799.html and http://daoblog.co.cc/wordpress/20101102-better-living-through-candy-creative-and-practical-uses-for-halloween-leftovers.html, A Brief History of the War on Groundhog Day was copied to http://www.internetsatire.com/humor/a-brief-history-of-the-war-on-groundhog-day/, and Sofa, Cat Toy, or Work of Art? The CatSofa is All Three was copied to http://www.internetsatire.com/humor/sofa-cat-toy-or-work-of-art-the-catsofa-is-all-three/.
And I don’t want to scare anyone off — I really do like seeing my posts quoted elsewhere, as long as there’s a link back to the original.
This business is rampant. I understand that, often, articles are automatically translated into another language and then back into English, to achieve the strange pigeon English style that is peculiar to scrape-farms, but something slightly different seems to be happening here.
It looks different to me, too — I think that’s because the words that are unchanged are completely unchanged — in the same order, without any extra articles inserted, etc.
I just dont get what the benefit to the scraper is. Do people really return to read that stuff enough for them to sell ads?
Maybe if I looked at it from a different vicinage…
Initially, I was in the same proportion that mystified as you are about that, but I think they have so much content (because they scrape so many sites) that they get lots of traffic from search engines, so they don’t really care whether anyone returns.
Oh Laura, I’m sorry to hear you have been scraped. I don’t think it has happened to me yet although I am seeing quite a lot of links to my blog from certain ‘enhancement’ sites, and I don’t really want to go and have a look… my blog isn’t of that sort. At least you’ve taught me a new word ‘vicinage’, and have highlighted not to use satire or humour as the tags to my post… funny as these stolen posts look, the originals are definitely the best!
Thanks, Tom — but please don’t throw the baby out with the bathwater! Humor and satire are perfectly useful tags, and I’m going to continue to use them.
Oh, and vicinage was a new word for me too, so this really has been a learning experience.
I’ve just noticed that humour and satire literally are missing from my blog… I’ll have to do something about that soon! :D
I do wonder if they’re going to copy this post too . . . and how that will come out!
I have been wondering from my site stats whether some Australian university student has been reading and copying my “book reviews” (I think they’re more “book thoughts”) for their assignments. If so, I’d really like to know what grade they got! :-)
You haven’t heard any complaints, right? So I’m sure they all got good grades.