TUHS March 2016

tuhs@tuhs.org

50 participants
52 discussions

by Will Senn

All, Is there a good source of information about the Unix v6 filesystem outside of the source code itself? Also, is there a source for the history of the early Unix filesystems from v6 onward? Thanks, Will

9 years, 5 months

Re: [TUHS] Documentation on B language?

by jnc＠mercury.lcs.mit.edu

> From: Brantley Coile > But B's ++ and -- operators seem to be unique. B seems to be like UNIX itself in this regard: a carefully selected set of old ideas, along with a few new ones of equal value. Noel

9 years, 5 months

Re: [TUHS] Documentation on B language?

by Doug McIlroy

>> https://www.bell-labs.com/usr/dmr/www/kbman.html >> https://www.bell-labs.com/usr/dmr/www/bintro.html > Yup, there certainly were different versions of B. Yes, kbman covers only one of the two implementations that cohabited the PDP-11. The other was the same language, with software paging, so it could have a larger data space. Various aspects of the language were borrowed from PL/I, BCPL and Algol 68. ++ and -- were novel operators. The reversal of Algol's assignment operators (e.g. -= became =-) was eventually repealed in C. doug e

9 years, 5 months

Re: [TUHS] TUHS Digest, Vol 4, Issue 6

by Doug McIlroy

@ mv a.out cat @ cat hello hello ken and doug It's an honor to witness a resurrection! doug

9 years, 5 months

PDP-7 Unix in action

by Warren Toomey

All, for your edification, some PDP-7 Unix programs running under the user-mode simulator (no working kernel as yet). Cheers, Warren @ ls -l 10271 drwr- 02 750 10000 . 10301 drwr- 12 750 10000 .. 10311 srwr- 01 750 05761 as 10321 lrwr- 01 750 11433 cat.s 10331 srwr- 01 750 00325 chmod 10341 srwr- 01 750 00325 chown 10351 srwr- 01 750 00212 chrm 10361 srwr- 01 750 00473 cp 10271 srwr- 01 750 01114 date 10301 srwr- 01 750 02734 ds 10311 srwr- 01 750 00023 hello 10321 srwr- 01 750 00173 ln 10331 srwr- 01 750 01510 ls 10341 srwr- 01 750 00173 mv 10351 srwr- 01 750 07120 opr.s 10361 srwr- 01 750 01645 sh 10271 srwr- 01 750 01106 stat [inum perms ln uid size name] @ date Mon Mar 09 1970 23:37:44 @ as opr.s cat.s I II opr.s cat.s @ mv a.out cat @ cat hello hello ken and doug

9 years, 5 months

Documentation on B language?

by Norman Wilson

Some starting points on B: https://www.bell-labs.com/usr/dmr/www/kbman.html https://www.bell-labs.com/usr/dmr/www/bintro.html Probably not exactly the PDP-7 edition, but near enough for reading at least. Norman Wilson Toronto ON

9 years, 5 months

Documentation on B language?

by Warren Toomey

So the PDP-7 code from Norman has a B interpreter. I know the history: BCPL -> B -> NB -> C, but I don't recall seeing a decent decription of the B language. Does anybody know of such a document? We'll need something like this so we can use the interpreter once we get it working :-) Cheers, Warren

9 years, 5 months

Re: [TUHS] TUHS Digest, Vol 4, Issue 3

by David

I grabbed a copy as well. A quick grep showed something I had forgotten: I ran a Source redistribution service. David > > On Tue, 2 Feb 2016, Warren Toomey wrote: > >> This is temporarily at http://www.tuhs.org/Usenet/Usenet.tar.bz2 if >> anybody else would like to grab it. > > Suitably grabbed :-) I know, I must finish my mirror some day (got a few > health issues right now)... >

9 years, 5 months

Usenet source archives

by arnold＠skeeve.com

At www.skeeve.com/Usenet.tar.bz2 is a copy of UUNET's archives of the various USENET source newsgroups. I created this file on September 2 2004. I made it for myself, since it was clear that uu.net would disappear sometime soon... It's 139 Meg - Warren maybe you can put it into the archives and everyone else can get it from there? I think the person who hosts www.skeeve.com has some monthly limits on data transfer and I don't want him blown out of the water. Thanks, Arnold

9 years, 5 months

Re: [TUHS] Short history of 'grep'

by Doug McIlroy

> I'm still trying to get my around about how a program such as "egrep" > which handles complex patterns can be faster than one that doesn't > Is there a simple explanation, involving small words? First, the big-word explanation. Grep is implemented as a nondeterministic finite automaton, egrep as a deterministic finite automaton. Theory folk abbreviate these terms to "NFA" and "DFA", but these are probably not what you had in mind as "small words". The way grep works is to keep a record of possible partial parsings as it scans the subject text; it doesn't know which (if any) will ultimately be the right one, hence "nondeterministic". For example, suppose grep seeks pattern '^a.*bbbc' in text 'a.*bbbbbbc'. When it has read past 3 b's, the possible partial parses are 'a.*', 'a.*b', 'a.*bb' and 'a.*bbb'. If it then reads another b, it splits the first partial parse into 'a.*' and 'a.*b', updates the next two to 'a.*bb' and 'a.*bbb', and kills off the fourth. If instead it reads a c, recognition is successful; if anything else, all partials are killed and recognition fails. Egrep, by preprocessing the expression, produces separate code for each of several possible states: "no b's", "1 b", 2 b's" and "3 bs". When it's in state "1 b", for example, it switches on the next character into "2 b's" or fails, depending on whether the character is b or not--very fast. Grep, on the other hand, has to update all the live parses. So if egrep is so fast, why do we have grep? One reason is that grep only needs to keep a list of possible progress points in the expression. This list can't be longer than the expression. In egrep, though, each state is a subset of progress points. The number of possible subsets is exponential in the length of the expression, so the recognition machine that egrep constructs before attempting the parse can explode--perhaps overflowing memory, or perhaps taking so long in construction that grep can finish its slow parse before egrep even begins its fast parse. To revert to the words of theory folks, grep takes time O(m*n) and space O(m) worst case, while egrep takes time and space O(2^m+n). (2^m is an overestimate, but you get the idea.) That's the short story. In real life egrep overcomes the exponential by lazily constructing the machine--not generating a state until it is encountered in the parse, so no more than n states get constructed. It's a complex program, though, for the already fancy preprocessing must be interleaved with the parsing. Egrep is a tour de force of classical computer science, and pays off big on scanning big files. Still, there's a lot to say for the simple elegance of grep (and the theoretical simplicity of nondeterministic automata). On small jobs it can often win. And it is guaranteed to work in O(m) memory, while egrep may need O(n). ------------------------------------------------- Ken Thompson invented the grep algorithm and published it in CACM. A pointy-headed reviewer for Computing Reviews scoffed: why would anybody want to use the method when a DFA could do the recognition in time O(n)? Of course the reviewer overlooked the potentially exponential costs of constructing a DFA. Some years later Al Aho wrote the more complicated egrep in the expectation that bad exponential cases wouldn't happen in everyday life. But one did. This job took 30 seconds' preprocessing to prepare for a fraction of a second's parsing. Chagrined, Al conceived the lazy-evaluation trick to overcome the exponential and achieved O(n) run time, albeit with a big linear constant. In regard to the "short history of grep", I have always thought my request that Ken liberate regular expressions from ed caused him to write grep. I asked him one afternoon, but I can't remember whether I asked in person or by email. Anyway, next morning I got an email message directing me to grep. If Ken took it from his hip pocket, I was unaware. I'll have to ask him. Doug

9 years, 5 months

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998

1997

1996

1995

1994

1993

1992

1991

1990

TUHS March 2016