Minutes of the ITC-Research Computing Group Meeting
December 21, 1998 at 10:00 AM in Astronomy 117

Members: Chris, Dawn, Dee, Ed, Hamp, Jacques, Jim J., Mark S., Sue Ellen, Tim S., Tim T., Tom S.,
Conveners:Tim T.
In Attendance: Dawn, Dee, Ed, Hamp, Sue Ellen, Tim S., Tim T., and Tom S.
Recorder: Tim T.
This webpage by Tim T.
Next Meeting: January 25 at 10 AM in Astronomy 117.

To Dos & Old Business:

Numbers in parenthesis match the numbers on the To-Do Table
of November 24, 1998 that we reviewed at this meeting. 

(1) Discussion on Orange Cluster and (2) SGI Origin200s policies.
------------------------------------------------------------------

Dee (for Jim): Remember that dollars for these computers are general
purpose computers - not just research computing, so access has to open.

TMS: Should accounts on blue automatically get orange? 

EHC: If we say that, they will come and compute.

TAS: 3 classes of rules, I thought:
New Suns -- cluster of batch machines, the existing ones became interactive
	with "police" running on them.

EHC: We have $55K for cluster expansion.  One issue is how many more nodes
blue needs; may need 2-3 nodes @ 5K each (Model 140).

TAS: We have 3 orange 200's - now called SP - in NIS domain for SP put
into load leveller, for long run jobs. GCG will be interactive with one
being front end to others because GCG licensed only for SGI.

EHC: concern about SGI200's, for folks doing NPACI stuff NCSA that's all
SGI - need some place here for development.  If we split them up into
interactive and batch, then hard for them to do work.
What if merge SP and unixlab, Sun, SGI accounts into one account.

TAS: need place to run parallel jobs.

TMS: The SP upgrade that's coming will have 8 parallel nodes.

EHC: Only 1 person needed more than 1 hours (of users moved from watt)
 moved him to the SP, which has newer nodes that are faster.

TAS: OK, two classes of users: 
	any one can get a blue.unix 
	anyone needing SGI, Sun for visualization work or the SP for
	 long running jobs. Graduate and Undergrads need faculty sponsor.

EHC: Simple and easy to do, will reduce the number of accounts
databases.  Have SPUnix lab be a subset of blue.unix so 1
userid/password. This is Unix Systems goal.

TO DO: (TFT) need to write-up this definition of classes and users
	for review. 
		
DMA: How can we map disk space for those with 2 accounts to shared disk
space.  There's about 250 SP users and 1000 Unixlab accounts to merge to
the Home Directory. Do we need to migrate those to Home Dir?

EHC: Planning on doing it Jan 5.

TFT: Timeline for ordering computers.

EHC: First determine blue.unix nodes needed. I need to order by mid
January.  Plan to get rid of SP/Unixlab distinction in late spring.

(8) SP upgrade
--------------

EHC: New nodes are going in on the SP right now. 12 done so far.
replacing slower ones with 160 MHZ chips and more memory and
re-distributing disks. So extra disk on each node for local storage on node
that's running job,  so won't need /bigtemp.

All nodes will be will have at least 512K memory. The wide nodes will
have1 GB memory. 

When upgrade is done will have 4 nodes at 120 MHZ, and the rest at 160
MHZ.  These "new" nodes are three to four times faster than existing
ones.

EHC: For now we'll designate the four 120 MHZ nodes as parallel and can
add to these and let them be ones with mix of 120 and 160 MHZ nodes.

TMS: We told Engineering we will create parallel pool, so have to
reserve nodes; create parallel pool of 8 nodes.

TAS: It'll be better to advertise what update we've done, but not the
number of nodes. Talk in terms of throughput, megaflops, etc. 

TAS to TMS: The Biomedical Engineering parallel pool not being used.
 TODO: (TAS) will write Mitch about this.

EHC: If/when we get the new SP switch we will have to take the whole SP
down.  Could create interactive nodes then.

(4) ADSM
---------

EHC: ADSM - new IBM agreement, it's now under "TIBLY" (spelling) suite.
#1->We know we have to get to get rid of maggie. and
#2->We've got to use something better than we had.
So let's get the latest/greatest before consortium runs out October,
1999. That way we can proceed. 
 Start in January to move from maggie to HSM  And let's not call
it "archive".

TMS: (to EHC) let's plan to add a tape robot to the budget for
1999-2000.

(6) CGC/SGI users
-----------------
No GCG/SGI issues

(3) Home Directory
-------------------

EHC: Home Directory, can lease disk space for an annual fee.  How much
is not back from cost accounting yet, likely to be about $63 per year
for 250MB.  Are in process of annnouncing.

EHC: need to advertise the new HSM correctly so users know what it is.
It is tape; so know users know they are getting files from tape.  As
differentiated from disk wedges, diskspace is yours - never run out, if
you do, you can buy more.

TMS: Have we considered sell backend storage to the library?  

EHC: Possibly.  When maggie proved unacceptable; they put their files on
CD and use a CD tower for PC access; so Unix may no longer be
appropriate for them.

 TODO: Need to document differences between /longtmp, HSM and disk wedges
that uses can purchase.

TMS/EHC: just leaves these as they are, users can go to either HSM or
disk wedge. Don't expand the "/longtmp" pool of disk.

 TODO:(EHC) will move /longtmp to HSM server; to HSM SSA array from
SEAS SSA array.  

EHC: The HSM disk cache is RAID array. Will continue public dataset
(/public and ICPSR). They'll go to HSM on separate file system from the
users.

TODO: (TFT) Reconvene ADSM group in January.
EHC: Think it'll be ready to go in January.
EHC: Possibly use ADSM as way to handle unix backups.

Report from UCIT and ACAC research computing subcommittees. 
-----------------------------------------------------------

TAS: UCIT sub-committee proposing to set up ACHS style research computing
support center. Working on first draft of this proposal.

TFT: ACAC sub-committee discussed issues long term and short goals for research computing
support. Headed up by Steve Stern.  Meet again in January.

Jump to III New Business:

Discuss Research Computing Support Center:

TEH: Let's consider using Small lab as Research Computing Support Center
(RCSC) place. The front of room is mostly empty. Lab has console and 28
workstations.  Seems like it's only about half used.

TMS/TAS/EHC: Problem is backroom is not available space to us, basement
isn't either.

EHC: configure old hobbes, poe, and ptolemy to be switchless serial
nodes?

Agreed to move this meeting to 10 AM Mondays for 1999. Last Monday of
the month.

Meeting adjourned, next meeting January 25, 1999, 10:00 AM =====================

<== Go to: ITC Research Computing Committee Home Page