[TUHS] Source code abundance?

Paul Ruizendaal pnr at planet.nl
Tue Mar 7 05:06:03 AEST 2017


My recollection of reading Groklaw way back when is that TSG had hired "a team of MIT deep divers" to do source code analysis, but I don't recall their methodology ever being disclosed.

I do recall some sort of sharding tool being proposed by the Linux' defenders (Eric Raymond?). From memory, the idea was to first parse the source into a normalized form to abstract away white space and variable naming. Then the tool would shard each normalized file into slices of 3 lines and compute a SHA[?] hash for each slice. Finally it would identify hash clashes and figure out if there were ranges of lines with clashing hashes. Human inspection could then identify whether the original lines involved looked like a case of copied source code. I do not recall that this tool was ever used or even built.



On 6 Mar 2017, at 17:19 , Jason Stevens wrote:

> I remember back in the original SCO vs the world days when SGi apparently dumped some 32v and was quickly dumped from Linux, there was some source auditing tool that they used?  All I remember is that it used the term 'shards' which of course is popular with the DB kids so it's hard to find... Now that there is an insane dump it'd be interesting to compare actual ancestry vs what we've always been told...
> 
> On March 6, 2017 11:33:18 PM GMT+08:00, Angelo Papenhoff <aap at papnet.eu> wrote:
> On 01/03/17, Arthur Krewat wrote:
>  Anyone ever looked at vetusware.com ?
>  
>  Google it with:
>  
>  site:vetusware.com unix source
>  
>  Is all of this stuff archived somewhere else?
>  
> 
> Since we had some discussion about AIX recently I thought I should ask:
> Many sources claim AIX is Sys V derived. Hoever, the source code (of 4.1.3)
> does not look like Sys V at all. Does anyone know the history?
> 
> aap
> 
> -- 
> Sent from my Android device with K-9 Mail. Please excuse my brevity.



More information about the TUHS mailing list