I just found http://programmingfaq.w3ec.com/faq/4761/whats-the-hi-lo-algorithm which has an exact copy of this Stack Overflow question,
What's the Hi/Lo Algorithm
[1], with all its answers and no difference in a single character. There is no reference to Stack Overflow.
Is this legal, or at least tolerated? Is this impertinent theft of information? I just can't see any sense in copying whole pages, particularly when not referencing the source.
ACCEPTED]
Stack Overflow is licenced under Attribution-Share Alike 3.0 Generic [1], which states:
Attribution — You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).
I think it is pretty clear that they have failed to do this (or any attribution whatsoever). So IMO (and IANAL) no: this usage is not legitimate. But within the terms of the cc-wiki agreement cited re-use is fine.
Edit: the cc-wiki licensing and attribution policy [2] are also linked on every footer page like so.

If you click through to the attribution policy [3] you will find the specifics:
[1] http://creativecommons.org/licenses/by-sa/3.0/So let me clarify what we mean by attribution. If you republish this content, we require that you:
- Visually indicate that the content is from Stack Overflow, Meta Stack Overflow, Server Fault, or Super User in some way. It doesn’t have to be obnoxious; a discreet text blurb is fine.
- Hyperlink directly to the original question on the source site (e.g., http://stackoverflow.com/questions/12345)
- Show the author names for every question and answer
- Hyperlink each author name directly back to their user profile page on the source site (e.g., http://stackoverflow.com/users/12345/username)
By “directly”, I mean each hyperlink must point directly to our domain in standard HTML visible even with JavaScript disabled, and not use a TinyURL [4] URL or any other form of obfuscation or redirection. Furthermore, the links must not be nofollowed [5].
There are a number of "illegal" clones popping up that use the cc-wiki data without following the creative commons attribution terms (these are linked at the bottom of every webpage [2], and also included as a .txt file in every data dump we produce).
Note: Google now has a form to report scrapers [3] when they appear above the original in search results.
Also, you can block unwanted sites from appearing in Google search from the search page itself [4]:
The option to block a site appears when you click a search result and then navigate back to the search results page. Click the "Block" link next to that result to block all pages within the site's entire domain.
The following is a list of all the illegal clones found so far. This post is a wiki; please edit in any new clones you find that are not already listed here!
http://johnnycoder.com/blog/2008/11/10/run-cmdexe-as-local-system-account/)http://www.shrenikvikam.com/adb-doesnt-detect-android-device-vodafone-845/)http://drija.com/multiple-monitors/54867/how-to-move-an-applications-dialog-box-from-a-disconnected-monitor-to-the-main/)http://www.questionhub.com/StackOverflow/1341075)http://www.genmaint.com/kill-ajax-requests-using-javascript-using-jquery.html)http://fr.w3support.net/index.php?db=su&id=9811)http://www.nujk.com/timer-job-from-a-spweb) (
original
[6])http://www.ency9.com/gadget/indexing-for-google-scholar-which-tags-to-use/) (
original
[7])http://need-programmer.blogspot.com/2010/09/how-to-use-subqueries-in-sqlalchemy-to.html) (
original
[8])http://news.puppin.org/question/non-relational-db-which-one-is-the-right-choice-closed/) (
original
[9])http://www.softwaretalk.info/how-can-i-start-from-page-1-again-on-the-4th-page-of-my-ms-word-document.htm) (
original
[10])http://asp.net.bigresource.com/MVC-3-Razor-Syntax-partial-view-Menu-cshtml-with-full-markup-a9sAFbss8.html) (
original
[11])nofollow in footer links to Stack Exchange sites (but regular links to the site otherwise)www.justlogged.com/Question/3/2786/fb3f76858cb38e5b7fd113e0bc1c0721http://codinganswers.info/index.php/2010/11/stackapplet-stackoverflow-meets-the-gnome-desktop-v1-4-released/http://www.texmach-intl.com/stackapplet-bringing-stack-exchange-notifications-to-your-desktop-1-5-beta-2-released/http://vniup.com/index.php/category/computer-science/)http://b.vniup.com/index.php/category/english-learning)http://b.vniup.com/index.php/category/electronic)http://b.vniup.com/index.php/category/gamer)http://b.vniup.com/index.php/category/geographic-information-system)http://b.vniup.com/index.php/category/house-improvement)http://b.vniup.com/index.php/category/physics)http://b.vniup.com/index.php/category/sharepoint)http://b.vniup.com/index.php/category/text-and-document)http://vniup.com/index.php/category/ubuntu)http://it.6-da.com/show/6939570.aspx) (
original
[26])http://www.x2x1.com/show/6766034.aspx http://stackoverflow.com/questions/6766034/cancel-block-in-uiview-animatewithdurationhttp://www.qandasystem.info/security/securing-an-area-both-physically-and-technically/http://java.resourcezen.com/how-to-copy-a-function-or-class-to-another-file-1http://codeblow.com/questions/good-ruby-on-rails-free-hosting duplicates http://stackoverflow.com/questions/1055682/good-ruby-on-rails-free-hosting.http://www.dkphp.com/questions-2/pros-and-cons-of-using-a-cursor-in-sql-server.htmlhttp://www.dkphp.com/questions-2/incorrect-date-comparison-results-in-sql-server-2008-r2.htmlhttp://www.rqna.net/qna/iiymmi-how-to-have-a-type-in-closure-compiler-externs-without-a-constructor.html)Note that if you find a system that does appear to be following the terms of the CC license and providing sufficient attribution, you may still want to report it on this MSO question [30] if it is appearing ahead of the original SE site in Google search results.
[1] http://meta.stackoverflow.com/q/131846/131713I do believe the problem is with the following paragraph:
Attribution — You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).
(emphasis is mine)
What's this manner specified? I think this is too subjective. How can you just point at people and say they are ilegal using your content if you didn't specified what exactly this "attribution" means?
The only complete reference about this subject that I found is within an official blog post [1] (it even has the website you're concerned about as an example).
I do believe it would made no harm a simple url below the cc-wiki image in SO footer, named "Attribution Guidelines" that are contained in this post [2]. Doing this way people have no excuse of "misunderstanding" attribution guidelines since you explicity said what you need to to when using SO content.
[1] http://blog.stackoverflow.com/2009/06/attribution-required/I am not a lawyer...but to me the site clearly violates the terms of use:
The last two are serious violations of the spirit and letter, and I think that persecution would be warranted.
Unfortunately, the registrant [1] is in China, but godaddy.com will probably cut them off at the knees if @[Jeff Atwood] requests it.
[1] http://who.godaddy.com/WhoIs.aspx?domain=w3ec.com&prog_id=godaddyThis seems a bit worrying:
Searching Google for my one and only (woohoo!) Stackoverflow question using unambiguous terms returns the top result as a tuts9.com copy of the SO page. Also clicking this result link gives me a 403/forbidden page from tuts9.com (Google cache page confirms this page is my SO question even though they filed it as a VB/VB.NET question!).
My original stackoverflow question is here:
http://stackoverflow.com/questions/3251191/how-to-prevent-non-repeatable-query-results-using-persistence-api-in-java-se
Google.com results for "how to prevent non repeatable query results using persistence api in java se" :
http://www.google.co.uk/search?hl=en&q=%22how+to+prevent+non+repeatable+query+results+using+persistence+api+in+java+se%22
[1]
The SO page is nowhere to be seen instead the top result from Google is:
http://tuts9.com/questions/25608/how-to-prevent-non-repeatable-query-results-using-persistence-api-in-java-se
Looks like Google are eliminating the SO page from the results as if it were spam, instead giving the tuts9.com copy the 'top slot'. Are Google being gamed here? I've sent a feedback message to Google raising this issue (through their web form though so I don't know how much attention it will get). Maybe someone from SO contacting Google directly would have more effect.
Update (2010/09/02): Currently the top 2 Google results from the search above are for tuts9.com followed by this question on meta! The original SO question is omitted for being too similar (too similar to tuts9.com copies? lol). When similar results are included the SO question appears in page 2 of results.
[1] http://www.google.co.uk/search?hl=en&q=%22how+to+prevent+non+repeatable+query+results+using+persistence+api+in+java+se%22I just found out about this thread while googling our site...
From what I understand from CC-by-SA [1] we comply with everything the license permits.
We ALWAYS provide a link back to the original content and a link to the source site's homepage. We always state that we don't own any of the content and state our sources. We just want to help people find good articles. We are a meta-search engine.
We never thought that we would have that much traffic. In the next few weeks, we will reduce the number of articles coming from Stack Overflow and Server Fault and will have parterships with good publishers in order to have great blog articles.
Things move so fast on the internet that we have gotten caught in a big spiral. That's why we need ads in order to have access to better servers and invest time into the site to make it better.
Our goal is to be a good meta-search engine and blog mashup. We need good articles, so do not hesitate to contact us if you want your site included.
[1] http://creativecommons.org/licenses/by-sa/2.5/answermoz.com is copying questions without attribution. The copy of a question at answermoz.com appears in Google searches ahead of results from money.SE (try searching on the question titles below.)
Example 1
http://money.stackexchange.com/questions/3461/what-differentiates-index-funds-and-etfs
http://answermoz.com/what-differentiates-index-funds-and-etfs/
Example 2
http://money.stackexchange.com/questions/3118/is-my-credit-score-of-766-lower-than-it-should-be
http://answermoz.com/is-my-credit-score-of-766-lower-than-it-should-be/
Hi folks. I found this answer today:
http://www.go4answers.com/Example/non-nullable-columns-db-becomes-101286.aspx
It looks very much like a Stack Overflow question, but it could be it has nothing to do with Stack Overflow. I thought I might just pop something quickly, to-be-sure (/said in an Irish way, with no offence to our Irish friends).
It seems google has created this tool to address and combat just this problem: Personal blocklist [1]
The personal blocklist extension will transmit to Google the patterns that you choose to block. When you choose to block or unblock a pattern, the extension will also transmit to Google the URL of the web page on which the blocked or unblocked search results are displayed. You agree that Google may freely use this information to improve our products and services.
I guess if you want to combat this thing, the easy thing to do is to use this extension. Many people blocking a site is sure to get their attention, thereby removing it from search results, thereby solving most of the problem.
[1] https://chrome.google.com/webstore/detail/nolijncfnkgaikbjbdaogikpmpbdcdefWhat about stackmobile? They say they are not affiliated with stackoverflow, so it's clearly not an official mobile version of SO.
They scrape all SO sites, including all data about users, badges, etc., yet they don't provide direct link to questions nor do they provide direct links to profiles on SO sites.
They also don't mention creative commons license anywhere on their site.
Example: http://stackmobile.com/view_question.php?site=stackoverflow&id=4587642 [1]
Is it OK for them to scrape SO sites like that?
[1] http://stackmobile.com/view_question.php?site=stackoverflow&id=4587642