1. HOME
  2. Information
  3. Current Trouble
  4. Failure of network switch (already recovered)

コンテンツ

Current Trouble

Failure of network switch (already recovered)

publication date : Sep.17, 2020


To all supercomputer users

Due to the failure of the network switch, we are experiencing intermittent storage performance deterioration since 11:20 a.m. on September 17.
We are currently working on the restoration, so please wait for a while.


Added:2020/09/17 18:00

At 18:00, the storage response has returned to normal.
Due to this failure, the following jobs in System B/C have been rerun or terminated abnormally.

System B: List of aborted jobs

5457087 5457088 5457102 5457108 5457114 5456960 5457268
5457594 5457090 5457097 5457098 5457111 5454008 5455948
5457070 5457117 5454012 5457076 5454019 5457072 5457077
5457122 5457076 5453976 5457071 5457071 5455945 5455974
5457109 5456015 5457272 5456446 5456030 5457080 5454138
5456042 5454138 5453967 5457227 5456076 5456455 5456457
5456458 5456086 5456449 5456051 5454407 5456450 5456432
5456000 5456431 5455819 5455728 5456065 5456435 5456448
5456092 5455090 5457130 5456058 5456003 5456451

System B: List of re-executed jobs

5453962 5456638 5456609 5456605 5456619 5453954 5456278
5457263 5457113 5457116 5456919 5452413[13] 5452413[15] 5452413[17]
5452413[18] 5452413[19] 5454210 5456640 5456604 5456908 5402296
5456610 5456980 5454200 5446348 5436326 5451663 5451665
5451669 5451673 5451674 5456641 5456636 5457059 5456596
5453956 5447353 5452308 5425769 5445673 5457061 5456913
5456634 5456637 5456925 5402445 5402453 5456911 5423841
5423841 5402464 5402454 5456555 5457393 5456538 5457394
5457395 5456549 5457396 5457397 5456547 5454197 5457527
5457528 5457529 5457530 5457531 5457532 5457533 5457534
5457535 5456909 5456901 5454035 5457221 5456983 5402430
5456628 5456618 5402459 5402442 5456627 5402422 5402424
5456935 5457083 5457435 5457436 5457437 5457438 5457439
5457440 5457441 5457442 5457443 5457444 5457446 5457447
5457448 5457449 5457450 5457451 5457452 5457453 5445678
5445679 5454257 5454687 5442067 5456244 5457342 5457343
5456265 5456239 5457343 5457344 5456260 5457079 5457151
5457151 5457234 5456968 5456494 5456502 5456484 5454795
5454830 5456505 5454842 5456488 5456495 5457482 5457483
5457485 5456427 5457486 5457487 5457488 5457489 5457107
5457110 5457100 5457101 5456914 5456912 5457150 5457166
5457167 5457168

System C: List of re-executed jobs

415060 414996 415106

Added: 2020/09/23 8:40:00

A job that terminated abnormally in System B was added.
I apologize for taking so long to publish this article.
Abnormally completed jobs are as follows.

System B: List of aborted jobs

5424023
5454073 5454517 5454928 5455562 5402362 5446648
5455000 5454685 5454969 5455956 5454513 5455778 5454673
5455936 5456315 5456327 5456528 5456021 5456890 5456955
5453948 5453955 5457214 5457217 5457212 5457223 5454171
5445314 5457382 5457224 5457557 5456507 5457508 5454123
5456237 5454834 5457160 5454080 5457495 5456248 5455718
5455726 5455720 5455750 5457173 5457197 5454841 5454800
5454825 5457412 5457352 5455738 5457362 5457380 5457473
5455741 5454822 5454801 5456548 5456222 5457399 5455814
5457456 5454837

The system was restored. We apologize for the inconvenience and trouble that you may have had.
Date of occurrence 2020/09/17 11:20 ~2020/09/17 18:00
Inquiry Supercomputing Section, IT Services Division, Information Management Department, Kyoto University
E-mail:consultkudpc.kyoto-u.ac.jp
Inquiry Form

Back to Current Trouble

 

Copyright © Institute for Information Management and Communication, Kyoto University, all rights reserved.