Raj Kettimuthu put together a Web page with extracts from a few emails that have circulated from happy GridFTP users. Perhaps this will spur others to tell their stories, good or bad (we're interested in both).
« June 2008 | Main | August 2008 »
Raj Kettimuthu put together a Web page with extracts from a few emails that have circulated from happy GridFTP users. Perhaps this will spur others to tell their stories, good or bad (we're interested in both).
Posted by Ian Foster on July 21, 2008 | Permalink | Comments (0) | TrackBack (0)
Reblog
(0)
| |
|
|
I was invited to speak at the University of Chicago's Physics Colloquium. I foolishly agreed and then had to work out what I could possibly say that would be of interest to physicists. I figured that a talk on distributed computing wouldn't be too interesting. Instead, I gave a more general talk with the grandiose title "Computation and Knowledge":
I speak to the question of how computation can contribute to the generation of new knowledge by accelerating the work of distributed collaborative teams and enabling the extraction of knowledge from large quantities of information produced by many workers. I illustrate my presentation with examples of work being performed within the Computation Institute at the University of Chicago and Argonne National Laboratory.
They seemed to enjoy it, and I enjoyed preparing it.
Posted by Ian Foster on July 18, 2008 in Science | Permalink | Comments (0) | TrackBack (0)
Technorati Tags: Computation, Knowledge
Reblog
(0)
| |
|
|
We all know that the US patent system has its weaknesses: that every now and then, people get granted patents for things that are either well known or obvious. This probably explains why people keep filing dumb patents: you never know when you might hit the jackpot.
In any case, here's a doozy: following on some 40 or more years of distributed computing research, and 15+ years of commercial and academic grid computing, IBM has finally gotten around to applying for a patent on grid computing. For example:
a computer-implemented method of providing access to grid computing resources available to a plurality of users comprises receiving, from a requesting entity, a request to use a specific grid computing resource to perform a defined function; and routing the request to the specific grid computing resource in accordance with the request.
If only Condor, Globus, Sun Grid Engine, PBS, LoadLeveler (oops, that's an IBM product), Platform LSF, etc., etc., had thought of that! Wait, they did, a decade or more ago. Sigh ...
Posted by Ian Foster on July 16, 2008 in Law | Permalink | Comments (1) | TrackBack (0)
Technorati Tags: IBM, patent, software patent
Reblog
(0)
| |
|
|
I gave a brief talk at HPC 2009 in Cetraro, Italy, on Grid Projects in the US. I tried to explain what I see as three complementary sets of activities:
As with any attempt to categorize complex activities, the divisions are not entirely accurate. For example, the cancer biomedical informatics grid (caBIG) project operates resources, host services, and provides content. But in recent work, caBIG has started making use of TeraGrid resources.
Posted by Ian Foster on July 15, 2008 in Events | Permalink | Comments (0) | TrackBack (0)
Technorati Tags: caBIG, globus, grid, Open Science Grid, OSG, TeraGrid, US grid projects
Reblog
(0)
| |
|
|
Tim Freeman writes about how to create one click clusters using the Globus workspace service.
Univa UD describes how to create UniCluster clusters on EC2.
We wrote a paper about on-demand clusters a while back. It's how all clusters should be created :)
Posted by Ian Foster on July 10, 2008 in Globus | Permalink | Comments (0) | TrackBack (0)
Technorati Tags: Cloud, Globus, Grid, UniCluster, Univa UD, Workspace
Reblog
(0)
| |
|
|
I mentioned the upcoming Workshop on Many Tasks Computing on Grids and Supercomputers. (Submission deadline: August 15.) What, you may wonder, do we mean by "many tasks computing"?
I (and my co-organizers Ioan Raicu and Yong Zhao) use the term to denote high-performance computations comprising multiple distinct activities, coupled via (for example) file system operations or message passing. Tasks may be small or large, uniprocessor or multiprocessor, compute-intensive or data-intensive. The set of tasks may be static or dynamic, homogeneous or heterogeneous, loosely or tightly coupled. The aggregate number of tasks, quantity of computing, and volumes of data may be extremely large.
Were we right to coin a new term, multi task computing, to denote such applications? There are certainly alternatives that we could have used instead. For example:
Are we making a useful distinction in using the term "many task computing" rather than one of those above, or just engaging in unnecessary neologism? Tell me what you think!
Perhaps we could simply have said: applications that are communication-intensive but are not naturally expressed in MPI. In that sense (and this is really the primary goal of the workshop) we are simply drawing attention to the many computations that are heterogeneous but not "happily parallel." Such computations can arise for a variety of reasons, such as:
In any case, we hope to see submissions from people working on high throughput computing, data-intensive scalable computing, and any other sort of high-performance computing that isn't conventional SPMD.
Posted by Ian Foster on July 09, 2008 in Events | Permalink | Comments (3) | TrackBack (0)
Technorati Tags: capacity computing, high throughput computing, HTC, many tasks computing, MPMD, MTAGS
Reblog
(0)
| |
|
|
I spent a wonderful week in Cetraro, Italy, at the 9th HPC Conference. A lovely location, exceptional colleagues, and fascinating presentations and discussions on various topics relating to HPC and Grid. Prof. Lucio Grandinetti organizes a wonderful event.
A focus this year on hardware--people showing off their impressive close-to-petascale computers and talking about plans for exascale. Lest people get too proud, Jack Dongarra reminded us that on average it takes 6-8 years for the number one system on the Top 500 list to fall off the bottom. Some discussion of GPGPUs (but does anyone believe that they won't be replaced by general-purpose multicores within a few years?). Also much discussion concerning power management--but not clear how serious anyone is about the topic. (E.g., no real discussion of what tradeoffs will be made to reduce power consumption, or mention of genuinely low-power systems, such as SiCortex.)
Considerable discussion of "clouds" (whatever they may be) much of it naive and ill-informed. Fascinating to hear the same expansive expectations expressed (without any apparent doubts) for "clouds" as we heard five years ago from the most fullsome proponents off grids. "Soon, applications of all sorts will be hosted in the cloud." Well maybe, but big legal, business model, and sociological barriers have so far hindered hosting of core business applications as services, and those barriers seem slow to fall. "Clouds offer arbitrarily scalable computing." Scalable to infinity, for any value of infinity less than a few hundred (at least at present). "Clouds offer much simpler interfaces than grids." There isn't much to choose between say EC2 and Globus Workspace Service from the interface perspective.
Perhaps the most thoughtful remarks on these topics were from Ignacio Llorente, who is working with relevant technologies via his Globus GridWay and OpenNebula projects. He characterized clouds as "a paradigm for the on-demand provision of virtualized resources as a service" (correctly identifying virtualization rather than interfaces as the key advance) and grids as "the technology that will allow for cloud interoperability." It will be interesting to see whether cloud interoperability emerges as an important requirement. In the grid space, a lack of interoperability has so far proved to be more of an irritant than a real obstacle to progress.
Discussion of the European Grid Initiative, a proposal to create a pan-European grid linking national grids within each EU state. A curious lack of discussion concerning the application drivers for this new infrastructure, or their requirements. Is EGEE (focused on distributing jobs to federated clusters) really the right model? Why no discussion of data federation (a big driver for grid computing in the UK and US), services (at the heart of successful grids such as caBIG, BIRN, and Earth System Grid), or the role of supercomputer centers (surely major "powerplants"?). The software strategy was striking in its lack of vision--keep supporting three separate European middleware platforms (ARC, gLite, Unicore), and attempt to integrate them (to what end?), while ignoring the Condor and Globus software used by so many Europeans. Sounds more like "jobs for the boys" than a strategy for supporting European eScience. Let us hope for an outbreak of vision among European grid leaders.
Posted by Ian Foster on July 05, 2008 in Events | Permalink | Comments (2) | TrackBack (0)
Technorati Tags: Cetraro, Clouds, EGI, HPC 2008, Lucio Grandinetti
Reblog
(0)
| |
|
|
Ioan Raicu, Yong Zhao, and I are running a Workshop on Many Tasks Computing on Grids and Supercomputers (MTAGS) at SC'08 (Austin, Texas, November 17, 2008). The deadline for submissions is August 15.
Our goal in convening this workshop is to encourage discussion among those interested in escaping the "SPMD rut" that characterizes much work in parallel, if not grid, computing. We believe that new applications and more powerful computers are spurring increased interest in computations involving many separate tasks, loosely or tightly coupled, often linked via a global name/file space. These computations may be no less demanding of computing, file systems, and networks than SPMD computations, but have important differences that may require new tools. Topics of interest include:
Compute resource management
- Scheduling
- Job execution frameworks
- Local resource manager extensions
- Performance evaluation of resource managers in use on large scale systems
- Challenges in running many-task workloads on HPC systems
Data Management:
- Data-Aware Scheduling
- Shared File System performance and scalability in large deployments
- Distributed file systems
- Data caching frameworks and techniques
Large-Scale Workflow Systems
- Workflow system performance and scalability analysis
- Scalability of workflow systems
- Workflow infrastructure and e-Science middleware
- Programming Paradigms and Models
Large-Scale Many-Task Application
- Large-scale many-task applications
- Large-scale many-task data-intensive applications
- Large-scale high throughput computing (HTC) applications
- Quasi-supercomputing applications, deployments, and experiences
Posted by Ian Foster on July 04, 2008 in Events | Permalink | Comments (0) | TrackBack (0)
Reblog
(0)
| |
|
|
iGSTW has a nice article on the marvel that is GridFTP (and a nice photo of Raj Kettimuthu). The article says "these features will be in GT4.2, to be released in summer 2008." GT4.2 was released on July 2.
Posted by Ian Foster on July 03, 2008 in Globus | Permalink | Comments (0) | TrackBack (0)
Reblog
(0)
| |
|
|
Charles Bacon, GT integration lead, writes: "On behalf of the Globus Toolkit development team I am pleased to announce that a new stable release of the Globus Toolkit is now available. GT4.2.0 contains an upgrade to the web services specifications used by the toolkit [to the final WSRF and WS-Notification specifications] as well as new features in all services. New users are encouraged to use the 4.2.0 release. Existing users may wish to evaluate the new software while maintaining their existing installations; due to the specification upgrade, the web services are incompatible with the 4.0.x series. Details on the spec upgrade are available in the release notes.
Relevant 4.2.0 links:
- Release notes: http://www.globus.org/toolkit/docs/4.2/4.2.0/rn/release_notes.html
- Software: http://www.globus.org/toolkit/downloads/4.2.0/
- Documentation: http://www.globus.org/toolkit/docs/4.2/4.2.0/
Thanks for your support of Globus software!"
Posted by Ian Foster on July 02, 2008 in Globus | Permalink | Comments (0) | TrackBack (0)
Technorati Tags: Globus, GT4.2
Reblog
(0)
| |
|
|
| Sun | Mon | Tue | Wed | Thu | Fri | Sat |
|---|---|---|---|---|---|---|
| 1 | 2 | |||||
| 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| 10 | 11 | 12 | 13 | 14 | 15 | 16 |
| 17 | 18 | 19 | 20 | 21 | 22 | 23 |
| 24 | 25 | 26 | 27 | 28 |
Recent Comments