Rfc | 0420 |
Title | CCA ICCC weather demo |
Author | H. Murray |
Date | January 1973 |
Format: | TXT, HTML |
Status: | UNKNOWN |
|
Network Working Group H. Murray
Request for Comments: 420 Computer Corporation of America
NIC 12764 04 January 1973
CCA ICCC WEATHER DEMO
THE WEATHER DEMO ORIGINALLY DEVELOPED FOR THE ICCC SHOW IS NOW
GENERALLY AVAILABLE. IT IS A VERY NICE NETWORK DEMO SINCE IT REALLY
CAN/DOES RUN ON TWO MACHINES.
THIS DEMO IS TWO INDEPENDENT PROGRAMS. TOGETHER THEY TRY TO ANSWER
ENGLISH QUESTIONS ABOUT THE WEATHER. FOR EXAMPLE, 'DID IT RAIN IN
BOSTON ON OCTOBER 7?'. THE NATURAL LANGUAGE PROGRAM IS WRITTEN IN
LISP. IT IS A MODIFICATION OF A SYSTEM DEVELOPED BY T. WINOGRAD AT
MIT. THE OTHER HALF IS A PRIMITIVE DATACOMPUTER. THE DATACOMPUTER
PROVIDES NETWORK DATAMANAGEMENT FACILITIES AND WILL INCORPORATE THE
TRILLION-BIT UNICON CURRENTLY BEING INSTALLED AT AMES.
THE TWO PROGRAMS COMMUNICATE VIA DATALANGUAGE USING THE NET.
'WEATHER' NOW USES HOST '37' AS THE LOCATION OF THE DATACOMPUTER.
THIS FIXES THE OLD 'CCA' NOT IN THE HOST TABLES BUG.
KEEP IN MIND THAT IS AN EVOLVING SYSTEM, WE PLAN TO IMPROVE 'WEATHER'
AND THE DATACOMPUTER THAT IT USES, SO THINGS THAT WORK ONE DAY MAY
GET BROKEN THE NEXT.
AVAILABILITY
'WEATHER' (THE LISP HALF) IS A REAL MACHINE USER - BOTH CPU AND CORE.
WE WOULD OBVIOUSLY PREFER THAT YOU RUN IT ON YOUR MACHINE. IF WE ARE
REALLY EMPTY (WHICH FREQUENTLY HAPPENS AT 3 AM) FEEL FREE TO RUN IT
ON OUR MACHINE. SINCE OUR SYSTEM HAS NO DRUM, AND WEATHER TAKES ALL
OF CORE, PLEASE CHECK WITH SOMEBODY AT CCA BEFORE RUNNING WEATHER IF
WE ARE DOING ANYTHING AT ALL. IT IS REALLY A MAJOR LOAD ON OUR
MACHINE.
SINCE THE DATACOMPUTER IS LESS OF A LOAD, THE SYSTEM WILL BE MORE
AVAILABLE IF YOU CAN RUN 'WEATHER' ON YOUR MACHINE. WE WILL TRY TO
RUN THE DATACOMPUTER DURING NON-PRIME TIME. IT CAN BE THOUGHT OF AS
THE SERVER HALF OF THIS COMBINATION. PRIME TIME HERE IS 1 TO 5 PM
EASTERN TIME, BUT THE MORNING IS FREQUENTLY QUITE BUSY TOO. PLEASE
DO NOT USE THE DATACOMPUTER DURING THIS TIME. IT WILL SAVE US THE
EFFORT OF TURNING IT OFF. CURRENTLY, ONLY ONE USER AT A TIME CAN RUN
- A LIMITATION OF THE DATACOMPUTER THAT WILL GO AWAY SOMEDAY.
IF YOU WANT TO RUN A DEMO (NOT JUST PLAY AROUND) WE WILL PROBABLY BE
ABLE TO COOPERATE. PLEASE CALL TO MAKE ARRANGEMENTS WELL IN ADVANCE.
WE STRONGLY RECOMMEND THAT YOU DO NOT TRY TO GIVE AN IMPORTANT DEMO
UNLESS YOU HAVE USED BOTH VERSIONS OF THE SYSTEMS THAT WILL BE USED
FOR THE DEMO. EVEN WE GET IN TROUBLE WHEN WE FORGET TO REHEARSE
THINGS A BIT.
WEDNESDAY MORNING IS SCHEDULED PREVENTIVE MAINTENANCE TIME. OUR
MACHINE WILL NOT BE UP BETWWEN 9 AND 12, BUT IT FREQUENTLY IS DOWN
PAST NOON.
MORE INFORMATION
FOR THE LATEST INFORMATION, LOOK AT *.TXT IN <WDB-DEMO> AT CCA. IN
PARTICULAR, MESSAGE.TXT SHOULD BE A QUICK SUMMARY OF ANY OTHER
INTERESTING TXT FILES. THE LATEST VERSION OF THIS FILE (IN RUNOFF
FORMAT) IS <HGM-MEMOS>ANNOUNCE.RNO.
THE LATEST VERSION OF THE LISP PROGRAM WILL BE IN <WDB-DEMO> AS
WEATHER.SAV. NORMAL FTP CAN BE USED TO COPY IT - IT IS 400+ PAGES.
IT TAKES 6 MIN TO TRANSFER IT TO OURSELVES (1.5 MIN OF CPU, 22KBAUD)
WHEN THE MACHINE IS VERY LIGHLY LOADED. BE SURE TO SET THE BYTE SIZE
TO 36 ('TENEX' COMMAND TO FTP) SINCE THIS IS A SAVE FILE.
PROBLEMS
IF YOU FIND ANY PROBLEMS WITH LISP HALF, PLEASE USE SNDMSG TO SEND
THINGS TO JEFF HILL - HE IS JMH@CCA. WE ARE INTERESTED IN EXAMPLES
WHICH LOOK LIKE THEY SHOULD WORK BUT DON'T. IF YOU HAVE ANY COMMENTS
ABOUT GENERAL OPERATIONS GLITCHES - LIKE THE DATACOMPUTER IS NOT UP -
BITCH TO ME - HGM@CCA. CCA IS HOST 31 DECIMAL, 37 OCTAL.
USING THE SYSTEM
AFTER STARTING WEATHER, THE SYSTEM WILL INITIALIZE ITSELF, TYPE OUT
SOME MESSAGES, AND ASK FOR THE JULIAN DATE (REALLY THE DAY NUMBER OF
THE YEAR OF 'TODAY'). 300 IS THE BEST ANSWER. THAT MAKES 'TODAY' BE
OCT 26, THE LAST DAY OF THE ICCC DEMO, AND ONE DAY PAST THE LATEST
DATA. IF YOU TYPE SOME OTHER NUMBER, 'YESTERDAY', AND 'LAST WEEK'
WON'T WORK AS WELL. THE SYSTEM PROMPTS WITH 'READY' WHEN IT IS READY
TO ACCEPT A SENTENCE. BE SURE TO END THE SENTENCE WITH THE PROPER
PUNCTUATION. SENTENCES MAY BE EDITED WITH:
^A (CONTROL A) -- DELETES LAST CHARACTER OR WORD TYPED
^R (CONTROL R) -- RETYPES SENTENCE SO FAR
^Q (CONTROL Q) -- DELETES ENTIRE SENTENCE
^X (CONTROL X) -- SAME AS CONTROL Q
IN ADDITION, TWO CONTROL CHARACTERS CAN BE USED DURING THE PROCESSING
OF SENTENCES, ^E (CONTROL E) ABORTS THE PROCESSING OF THE SENTENCE
AND RETURNS TO THE READY STATE. ^D (CONTROL D) ABORTS THE WHOLE
SYSTEM, AND CAUSES IT TO RE-INITIALIZE ITSELF.
DURING SENTENCE INPUT, THE SYSTEM ATTEMPTS TO LOOK UP EACH WORD AS IT
IS TYPED, AND DO SPELLING CORRECTION. HOWEVER, SINCE THE NAMES OF
ALL THE CITIES ARE KEPT AT THE DATACOMPUTER, THE SYSTEM ACCEPTS ALL
UNKNOWN WORDS AS POSSIBLE PLACE NAMES TO BE CHECKED LATER ON IN THE
PROCESSING. IF THE SPELLING CORRECTOR CORRECTS ONE OF YOUR CITY
NAMES TO A VOCABULARY WORD, YOU CAN ERASE THE WORD AND RETYPE IT
PRECEEDED BY AN EQUAL SIGN (=). THIS FORCES THE SYSTEM TO ACCEPT THE
WORD AS A PROPER NOUN.
THE SYSTEM RETURNS TO THE READY STATE AFTER ANSWERING THE QUESTION.
TO TERMINATE A SESSION YOU MAY TYPE 'GOODBYE.', WHICH WILL LEAVE YOU
BACK AT THE TENEX COMMAND LEVEL.
IF THE SYSTEM CANNOT UNDERSTAND YOUR REQUEST, IT TRIES TO GIVE YOU
SOME EXPLANATORY MESSAGE. HOWEVER, IT'S NOT VERY GOOD AT EXPLAINING
ITS BEHAVIOR SO YOU WILL USUALLY GET THE DEFAULT MESSAGE 'SORRY, I
COULDN'T UNDERSTAND THAT REQUEST.' IF YOU REALLY WANT TO KNOW WHY,
SEND US A TRANSCRIPTIPT OF YOUR SESSION.
RESTRICTIONS - HINTS
THE SYSTEM IS NOT LINGUISTICALLY SOPHISTICATED. IT PERFORMS PASSABLY
ON SENTENCES WHICH WOULD BE CONSIDERED SIMPLE REQUESTS FOR
INFORMATION. IT DOES NOT REALLY UNDERSTAND THE 'WEATHER', RATHER IT
UNDERSTANDS HOW TO COMPOSE RETRIEVAL REQUESTS FOR WEATHER DATA IN ITS
DATA BASE.
A LARGE SET OF EXAMPLE SENTENCES CAN BE FOUND IN <WDB-
DEMO>SENTENCES.TXT. THOSE SENTENCES PRECEDE BY A BACKSLASH (\) DO
NOT WORK. AN EXPLANATORY COMMENT, PRECEEDED BY A SEMI-COLON (;),
EXPLAINS THE REASONS FOR FAILURE. SOME OF THE OTHERS MAY NOT WORK
EITHER.
THE MAJOR THINGS THAT WE KNOW DON'T WORK ARE:
1. CONJUNCTION
2. NEGATION
3. COMPLEX QUANTIFICATION
4. SOPHISTICATED PRONOUN REFERENCE
5. QUESTIONS NOT (!!) INVOLVING WEATHER,E.G.,
'WHAT DAY WAS YESTERDAY?'
'WHAT CITIES ARE IN MASSACHUSETTS?'
'IT' IS NOT WORKING TOO WELL THESE DAYS. IF YOU GET IN TROUBLE, TRY
SOMETHING WITHOU 'IT' THAT IS KNOWN TO WORK. 'WHAT WAS THE
TEMPERATURE IN BOSTON YESTERDAY?' IS BETTER THAN MOST - WE ARE QUITE
APT TO TEST THAT PHRASE.
THE SYSTEM DOES NOT KNOW ABOUT THE WORD 'SNOW'. THERE IS NO WAY TO
LIST ITS VOCABULARY. HOWEVER, THE SENTENCES IN SENTENCES.TXT SHOULD
GIVE YOU A GOOD IDEA OF THE WORDS THE SYSTEM UNDESTANDS.
THE AVERAGE TIME FOR UNDERSTANDING AND ANSWERING A QUESTION IS ABOUT
1 MINUTE. IF THE SYSTEM TAKES MUCH LONGER THAN THIS YOU SHOULD GET
SUSPICIOUS. IF YOU ASK A VERY BIG QUESTION, LIKE 'WHAT WAS THE
HOTTEST PLACE IN CANADA?', THE LISP SYSTEM MAY GO INTO A GC (GARBAGE
COLLENTION) LOOP. RESTARTING ALL OVER (WITH ^D) WILL BE OK, BUT THE
DATACOMPUTER MAY TAKE A LONG TIME TO RECOVER - IT DOESN'T (YET)
NOTICE THAT ANYTHING HAS GONE WRONG UNTIL IT ASKS FOR MORE INPUT.
DATA
THE DATA FROM OCT 1 THRU 16 IS ALMOST COMPLETE. BETWEEN OCT 17 TO 25
IT IS FREQUENTLY SKETCHY. WE PLAN TO RELOAD EVERYTHING SOMETIME
USING SEPTEMBER'S DATA, SO WE WILL HAVE A COMPLETE MONTH OF CLEAN
DATA. NOTE: THIS DATA HAS BEEN EXTENSIVELY SUMMARIZED - IT IS
PROBABLY NOT USEFUL FOR PEOPLE INTERESTED IN WEATHER.
THE OCTOBER DATA IS ABOUT 2400 PAGES OF REAL DATA, PLUS ABOUT 200
MORE FOR THE INVERSION. THERE ARE ABOUT 4000 CITIES. THE CITY NAMES
HAVE BEEN CHANGED TO THE LOCAL CITY WHENEVER POSSIBLE. I.E. 'BOSTON'
IS 'BOSTON' NOT 'LOGAN'.
'ALASKA' GOT LOST SOMEHOW. DON'T BOTHER ASKING FOR IT.
SAMPLE DATALANGUAGE
THESE EXAMPLES ON THIS AND THE FOLLOWING PAGES WORK ON A VERY OLD
VERSION OF THE DATACOMPUTER. THEY ARE NOT THE FINAL FORM OF
DATALANGUAGE THAT SHOULD BA AVAILABLE SOON.
THE FOLLOWING QUESTION TO 'WEATHER' GENERATED TWO REQUESTS TO THE
DATACOMPUTER. THE FIRST USES THE CHECK PORT TO VERIFY THAT 'BOSTON'
IS A VALID CITY NAME, AND THE SECOND USES THE ANSWER PORT TO RETRIEVE
THE DATA. THE ACTUAL REQUESTS THAT ARE SENT TO THE DATACOMPUTER DO
NOT HAVE THE CARRIAGE RETURNS THAT MAKE THEM READABLE - THEY ARE SENT
AS ONE VERY LONG LINE. THE '^Z' IS THE END OF DATA MARKER.
TO WEATHER
HAS IT RAINED IN BOSTON RECENTLY?
TO THE DATACOMPUTER:
OR CHECK.PLACE,STATION WITH CITY EQ 'BOSTON'
PLACE.LP1 = '('
PLACE.CITY = STATION,CITY
PLACE.RP1 = ')'
PLACE.BSN = STATION.BSN
PLACE.LP2 = '('
PLACE.REGION = STATION.REGION
PLACE.RP2 = ')'
END
FROM THE DATACOMPUTER:
(BOSTON )725090(MASSACHUSETTS )
^Z
TO THE DATACOMPUTER:
FOR THE STATION WITH (REGION EQ 'MASSACHUSETTS' AND CITY EQ 'BOSTON')
FOR ANSWER.ANS OBSERVATION WITH GE '273' AND DATE LE '300'
ANS.LP1 = '('
ANS.CITY = STATION.CITY
ANS.RP1 =')'
ANS.DATE = OBSERVATION.DATE
ANS.LP2 = ')'
ANS.DATA = PRECIP
ANS.DATA1 = ' '
ANS.DATA2 = ' '
ANS.RP2 = ')'
END
END
AND BACK FROM THE DATACOMPUTER:
(BOSTON )293( )
(BOSTON )294( 0 )
(BOSTON )295( )
(BOSTON )296( 10 )
(BOSTON )297( 0 )
(BOSTON )298( 0 )
(BOSTON )299( 0 )
(BOSTON )300( )
^Z
AND THE FINAL ANSWER FROM WEATHER WAS 'YES.'
DESCRIPTIONS
THE FOLLOWING IS THE DESCRIPTION OF THE FILE AS IT IS STORED IN THE
DATACOMPUTER.
CREATE WEATHER FILE LIST
STATION STRUCT
BSN STR(6), I=D
CITY STR(22), I=D
REGION STR(22), I=D
WORLD STR(22)
OBS LIST (31)
OBSERVATION STRUCT
DATE STR(3)
TEMPERATURE STRUCT
MIN STR(4)
MAX STR(4)
END TEMPERATURE STRUCT
PRECIP STR(4)
WINDS STRUCT
SPEED STR(4)
GUSTS STR(4)
DIRECTION STR(4)
END WINDS STRUCT
VISIBILITY STR(4)
CLOUDS STR(4)
GENERAL STR(4)
PRESSURE
END OBSERVATION STRUCT
END STATION STRUCT
END WEATHER FILE
THE FOLLOWING PORT DESCRIPTIONS ARE USED BY WEATHER TO RETRIVE DATA
FROM THE DATACOMPUTER.
CREATE CHECK PORT LIST
PLACE STRUCT
LP1 STR (1)
CITY STR (22)
RP1 STR (1)
BSN STR (6)
LP2 STR (1)
REGION STR (22)
RP2 STR (1)
END PLACE STRUCT
END CREATE CHECK
CREATE ANSWER PORT LIST
ANS STRUCT
LP1 STR (1)
CITY STR (22)
RP1 STR (1)
DATE STR (3)
LP2 STR (1)
DATA STR (4)
DATA1 STR (4)
DATA2 STR (4)
RP2 STR (1)
END ANS STRUCT
END CREATE ANSWER
SAMPLE RECORD
THE FOLLOWING DATA IS SAMPLE RECORD - THE FIRST ONE IN THE FILE. IN
THE DATACOMPUTER IT IS STORED WITHOUT THE CARRIAGE RETURNS NEEDED TO
MAKE IT PRINT NEATLY.
010100ANDOYA NORWAY EUROPE
275
276 278 279 0 2 0 12 9 758883023
277 279 283 4 23 52 11 8 788882982
278
279 274 281 0 23 42 12 9 788882976
280 268 270 4 14 0 9 19 627772983
281 276 283 0 28 69 7 9 688882986
282 269 273 4 10 0 9 22 727772975
283 278 284 0 13 37 9 7 766882972
284 0 28 0 5 1 80006 0
285 276 280 0 9 0 3 12 666682936
286 273 275 0 15 0 3 7 877782960
287 278 282 0 13 28 11 14 626662981
288 269 271 28 30 0 6 7 888882908
289 275 281 0 21 45 7 9 688882976
290 268 269 0 20 0 8 22 622782960
291 283 283 0 11 0 7 47 100013002
292 274 274 0 11 0 1 47 500012996
293
294 272 272 0 5 0 7 47 400012949
295
296
297 273 273 0 0 0 1 6 80007 0
298 273 273 0 0 0 1 2 80007 0
299 275 275 0 6 0 7 31 30000 0
300
301
302
303
304
305
[This RFC was put into machine readable form for entry]
[into the online RFC archives by Helene Morin, Via Genie, 12/99]