I'm not a lawyer, but Google is doing the same and many other companies too. They never get sued, Google even caches your entire site without asking explicitly. The archive.org Project even copies to whole internet and luckily get's away with it.
Other than that, I'm avid when it comes to Ontologic, Semantic, Neurologic or Stochastic systems and would like to tell you that simply using Semantics wil not lead to a working solution.