if you worry that your theories may hold up to be somewhat true - you can spoof your back-end scripts in such way that they don't look like their default copies by customizing url structure and outputted html code.
content patterns and html source structure patterns may be used to find duplicate content - but its another subject for theorization how deep (by which i mean to what extent) search to weed out duplicate content can go.
|