1. HOME
  2. Information
  3. Current Trouble
  4. Failure of Network Switch (already recovered)

コンテンツ

Current Trouble

Failure of Network Switch (already recovered)

publication date : Nov.24, 2020


To all supercomputer users

Due to the failure of the network switch, the storage responsiveness has been deteriorating intermittently since around 15:50 on November 24.
We are currently working on the restoration.


Added: 18:20

The problem was fixed at around 17:30. Some jobs in System B/C have been forced to terminate/restart.
Affected Job IDs will be posted on this page at a later date, so please be patient.


Added: 11/25 8:45

System B: Job list of Abnormally terminated during the failure occurrence.

6109922 6157918 6159355 6229172 6230677 6259234 6259818
6260316 6261556 6326383 6334238 6360261 6361272 6387970
6396965 6405575 6411393 6414711 6434414 6438894 6438900
6438924 6439365 6458831 6458844 6458913 6458926 6459063
6461939 6466450 6466457 6466620 6466646 6470764 6470918
6470924 6470934 6470937 6471987 6472016 6474112 6474639
6476231 6476348 6476357 6476438 6476444 6476445 6476453
6476480 6476482 6476486 6476487 6476488 6476491 6476495
6476505 6476564 6476570 6476581 6476636 6476665 6476713
6476819 6476820 6476833 6476834 6476839 6476846 6476848
6476853 6476911 6476926 6476929 6476930 6476932 6476935
6476937 6476971 6477067 6477068 6477069 6477070 6477072
6477073 6477078 6477079 6477088 6477318 6477329 6477385
6477389 6477397

System B: Job list of rerun after recovery from the failure.

5730284 6031199 6141093 6226692 6287585 6287598 6287609
6287625 6287632 6287637 6287638 6287658 6293468 6311921
6311927 6329763 6331528 6335720 6335730 6335732 6335736
6335746 6335778 6335831 6335870 6335891 6335909 6335918
6335924 6335944 6349493 6354917 6381705 6387369 6388656
6388963 6389482 6390873 6392857 6398440 6409509 6409515
6409526 6409539 6409557 6409584 6409613 6409644 6409650
6409704 6409717 6410737 6410755 6410787 6410835 6410859
6410883 6415014 6429661 6431141 6454662 6462614 6462638
6462641 6464054 6472560 6472566 6472573 6476241 6476374
6476466 6476500 6476506 6476544 6476730 6476733 6476734
6476739 6476755 6476777 6476946 6476988 6477013 6477018
6477051 6477060 6477105 6477223 6477275 6477280 6477284
6477286 6477327 6477330 6477410 6477412 6477413 6477451

System C: Job list of Abnormally terminated during the failure occurrence.

421472

System C: Job list of rerun after recovery from the failure.

421234 421235 421236 421237 421295 421304 421305
421436 421454 421457 421458 421459 421460
The system was restored. We apologize for the inconvenience and trouble that you may have had.
Date of occurrence 2020/11/24 15:50 ~2020/11/24 17:30
Inquiry Supercomputing Section, IT Services Division, Information Management Department, Kyoto University
E-mail:consultkudpc.kyoto-u.ac.jp
Inquiry Form

Back to Current Trouble

 

Copyright © Institute for Information Management and Communication, Kyoto University, all rights reserved.