About ABCI

OPERATION STATUS

Current Operation Status and Problem Status

(As of 2022/05/24)

Start Date Time End Date Time The Current Operation Status and the Problem Status
2022/05/24 (Tue) 14:20 - The ABCI Services are available. See for Known Issues.

Future Operation Schedule

(As of 2022/05/20)

Start Date Time End Date Time The Future Operation Schedule
(Planned schedule may be changed.)
2022/06/15 (Wed) 13:30 06/21 (Tue) 13:00 ABCI including ABCI User Portal, is out of service due to the maintenance of ABCI system and water cooling system.
Mid of Oct Several days Due to the “ABCI Grand Challenge 2022#2” some of the Compute Nodes unavailable.
Early Dec Several days Due to the “ABCI Grand Challenge 2022#3” some of the Compute Nodes unavailable.
Mid of Dec Several days ABCI is out of service due to the maintenance of ABCI water cooling system.

Past Down History

(As of 2022/05/24)

Start Date Time End Date Time The Past System Down History
2022/05/24 (Tue) 13:00 14:20 The Home Area is inaccessible.
2022/05/18 (Wed) 11:00 2022/05/18 (Thu) 11:00 The “ABCI Grand Challenge 2022#1” restricts Compute Node(A). The other nodes are available.
2022/05/19 (Thu) 12:00 2022/05/20 (Fri) 12:00 The “ABCI Grand Challenge 2022#1” restricts Compute Node(V). The other nodes are available.
2022/05/12 (Thu) 17:40 18:57 The Home Area is inaccessible.
2022/05/12 (Thu) 10:15 11:05 The Home Area is inaccessible.
2022/05/11 (Wed) 15:06 17:08 The Home Area is inaccessible.
2022/05/10 (Tue) 19:00 05/11 (Wed) 0:05 The Home Area is inaccessible.
2022/05/09 (Mon) 18:50 23:45 The Home Area is inaccessible.
2022/04/21 (Thu) 11:00 04/22 (Fri) 10:00 Due to the maintenance of the ABCI User Portal, it is not available.
2022/04/06 (Wed) 17:00 2022/04/11 (Mon) 14:35 The Singularity Endpoint does not have a functioning the remote build feature.
Please see User Guide for alternative methods.
2022/04/06 (Wed) 17:00 2022/04/09 (Sat) 21:30 ABCI Cloud Storage service has some failures. Some users cannot access it.
Note: An email has been sent to users with instructions on how to restore the system.
2022/04/06 (Wed) 17:00 04/07 13:30 ABCI User Portal has some failures. Applications can be submitted, but there is a delay in the approval process.
2022/04/01 (Fri) 0:00 2022/04/06 (Wed) 17:00 ABCI is out of service due to the fiscal annual renewal maintenance.
- ABCI User Portal will also be down.
- Please understand that responding to questions to qa@abci.ai will be slower than usual.
- Running jobs will be forced to be killed.
- If the same job is resubmitted after the maintenance, it may behave differently than before since System Updates will be performed during the maintenance.
2022/03/30 (Wed) 17:00 2022/03/31 (Thu) 17:00 520 of the Compute Nodes (V) are unavailable due to maintenance. Other Computate Nodes (V) and (A) are available.
2022/03/19 (Sat) 10:00 18:30 Network communication between ABCI and INTERNET will be down multiple times.
· Connection to Interactive nodes or to ABCI User Portal will be broken.
· The batch jobs with the INTERNET access will be affected. Example: Communicating with a license server outside of ABCI, using ABCI cloud storage, etc.
2022/03/17 (Thu) 9:00 13:40 ABCI User Portal is out of service. The other services are available.
2022/03/16 (Wed) 0:00 6:00 Network communication between ABCI and INTERNET will be down multiple times.
· Connection to Interactive nodes or to ABCI User Portal will be broken.
· The batch jobs with the INTERNET access will be affected. Example: Communicating with a license server outside of ABCI, using ABCI cloud storage, etc.
2022/03/01 (Tue) 8:30 2022/03/03 (Thu) 17:00 ABCI is out of service due to the maintenance of the water cooling system and update. ABCI User Portal also stopped for maintenance.
Please understand that responding to questions to qa@abci.ai will be slower than usual.
See "System Updates" for the software updates on this maintenance.
2022/01/11 (Tue) 0:00 1:00 Network communication between ABCI and INTERNET will be down.
Unable to access to Interactive nodes and ABCI User Portal.
The batch jobs with the INTERNET access will be affected. Example: Communicating with a license server outside of ABCI, using ABCI cloud storage, etc.
2021/12/27 (Mon) 20:10 2021/12/30 (Thu) 11:39 The /projects area of the storage is failure.
It will be umount from 12/28 14:00, and it is expected that it will take several days to recover.
2021/12/10 (Fri) 13:00 2021/12/15 (Wed) 17:00 ABCI is out of service due to the electric power outage and the maintenance of the water cooling system.
See "System Updates" for the software updates on this maintenance.
2021/12/08 (Wed) 10:00 2021/12/10 (Fri) 13:00 The “ABCI Grand Challenge 2021#3” restricts Compute Node(A). The other nodes are available.
2021/09/14 (Tue) 11:00 2021/09/22 (Wed) 11:00 The “ABCI Grand Challenge 2021#2” reduces compute node(V) to 440 nodes. The other nodes are available.
2021/08/11 (Wed) 10:00 2021/08/12 (Thu) 15:00 ABCI is out of service due to the maintenance of ABCI. During this time, the mail server will also be stopped, so we will not be able to respond to inquiries to qa@abci.ai.
After this maintenance, /groups1/ and /fs3/ will be set to read-only.
2021/07/06 (Tue) 17:00 2021/07/19 (Mon) 11:00 Remote build of SingularityPRO is not available.
2021/07/02 (Fri) 13:00 2021/07/06 (Tue) 17:00 ABCI is out of service due to the maintenance of ABCI water cooling system.
2021/06/17 (Thu) 13:00 2021/06/18 (Fri) 13:00 The “ABCI Grand Challenge 2021#1” restricts compute node(A). The other nodes are available.
2021/06/16 (Wed) 12:00 2021/06/17 (Thu) 12:00 The “ABCI Grand Challenge 2021#1” reduces compute node(V) to 440 nodes. The other nodes are available.
2021/05/08 (Sat) 0:00 1:30 Network communication between ABCI and INTERNET will be down.
Unable to access to Interactive nodes, ABCI User Portal, and Q&A service of qa@abci.ai.
The batch job services without INTERNET access, such as cloud storage, will not be affected.
2021/04/01 (Thu) 0:00 2021/04/07 (Wed) 10:00 ABCI is out of service due to the ABCI maintenance, including ABCI user portal and Q&A service of qa@abci.ai.
We will stop providing the service of Singularity 2.6 on ABCI at the end of March, 2021. Please migrate to SingularityPRO 3.5.
2021/03/12 (Fri) 8:00 2021/03/13 (Sat) 15:00 ABCI is out of service due to the ABCI maintenance, including ABCI user portal and Q&A service of qa@abci.ai.
The number of ABCI interactive nodes, “es”, will be reduced from 4 to 2. The es3 and es4 will be withdrawn.
2021/03/05 (Fri) 8:00 2021/03/08 (Mon) 9:00 Due to the rack maintenance, only 374 compute nodes are available.
The other services are available.
2021/03/03 (Wed) 13:30 17:15 The insternal DNS server was not responding.
2021/02/26 (Fri) 8:00 2021/03/01 (Mon) 9:00 Due to the rack maintenance, only 544 compute nodes are available.
The other services are available.
2021/01/27 (Wed) 15:10 16:20 Job Execution Service was unavailable.
2021/01/05 (Tue) 10:30 16:30 Job Execution Service was unavailable.
2020/12/11 (Fri) 13:00 12/15 (Tue) 17:00 ABCI is out of service due to the electric power outage.
2020/12/10 (Thu) 12:30 12/11 (Fri) 13:00 All of the compute nodes and the memory-intensive nodes are going to be out of service due to the ABCI Grand Challenge 2020#3. The other ABCI services, such as interactive nodes, ABCI user portal, and Q&A service of qa@abci.ai are available.
Please refrain from too much frequent access to storage.
2020/11/30 (Mon) 3:00 4:00 Due to the network maintenance, network communication between ABCI and INTERNET will be down for at max 60 minutes. ABCI User Portal and access to the interactive nodes will be unavailable.
Since ABCI internal network is available, the batch job services will not be affected.
2020/11/25 (Wed) 20:15 23:40 Emergent maintenance for GPFS ( fs1, bb ) and UGE. All ABCI services stopped.
2020/11/25 (Wed) 10:45 20:15 Unable to access to GPFS ( fs1, bb ) and UGE.
2020/11/25 (Wed) 10:45 11:25 Unable to access to interactive nodes.
2020/10/9 (Fri) 9:00 14:30 Unable to use SingularityPRO due to update. The other services are available.
2020/9/08 (Tue) 12:35 21:49 Unable to access to GPFS ( fs1, fs2, fs3, bb ).
2020/8/28 (Fri) 13:00 21:30 ABCI is out of service due to system update.
2020/8/26 (Wed) 12:00 8/28 (Fri) 12:00 All nodes of computing server are restricted due to the “ABCI Grand Challenge 2020#2”.
2020/8/25 (Tue) 12:00 8/26 (Wed) 12:00 The “ABCI Grand Challenge 2020#2” reduced 522 nodes of computing server. Job execution may be delayed due to congestion.
2020/6/2 (Tue) 0:00 14:00 ABCI Cloud Storage is unavailable to all ABCI nodes due to an internal DNS problem.
2020/6/1 (Mon) 13:00 21:30 Due to an OpenSSL issue, aws-cli/1.16.194 and aws-cli/1.18 don’t work with ABCI Cloud Storage. aws-cli/2.0 works.
2020/5/29 (Fri) 13:00 6/1 (Mon) 13:00 ABCI is out of service due to the maintenance of ABCI water cooling system.
2020/5/28 (Thu) 12:00 5/29 (Fri) 13:00 The “ABCI Grand Challenge 2020#1” reduced 522 nodes of computing server. Job execution may be delayed due to congestion.
2020/5/28 (Thu) 6/01 (Mon) 10:50 User Registration system on the ABCI User portal was not functioning.
2020/5/2 (Sat) 16:20 5/3 (Sun) 19:53 All the compute nodes were unavailable(Jobs will not run).
2020/4/1 (Wed) 09:00 4/3 (Fri) 20:00 Due to the maintenance, all services will be unavailable(qa@abci.ai will also be unavailable).
2020/4/1 (Wed) 00:00 4/3 (Fri) 20:00 All the compute nodes were unavailable(Jobs will not run). ABCI User Portal will be unavailable.
2020/3/10 (Tue) 15:50 18:42 Due to the trouble in the storage system, /home was not responding.
2020/2/29 (Sat) 09:00 20:00 Network Maintenance(short interruptions of the internet access from/to ABCI).
2020/2/28 (Fri) 10:00 14:00 ABCI User Portal is unavailable due to the planned maintenance.
2020/2/21 (Fri) 20:50 22:03 Due to the trouble in the storage system, interactive nodes and jobs were affected.
2020/2/13 (Thu) 13:55 2/17 (Mon) 15:30 Due to the trouble in the cooling system, 34 compute nodes(g0851-g0884) stopped.
2020/1/22 (Wed) 13:00 18:00 ABCI Cloud Storage Service and ABCI User Portal stopped due to the maintenance. ABCI interactive/compute nodes, home/group area and qa@abci.ai mail service are available.
2019/12/24 (Tue) 13:03 14:06 /home was not responding.
2019/12/20 (Fri) 22:38 12/22 (Mon) 11:56 /home was not responding.
2019/12/19 (Thu) 12:59 14:44 /home was not responding.
2019/12/13 (Fri) 13:00 12/17 (Tue) 13:00 ABCI is out of service due to the maintenance of electricity in Kashiwa-site.
2019/12/13 (Fri) 09:00 12/17 (Tue) 13:00 ABCI Cloud Storage is out of service due to the maintenance.
2019/12/10 (Tue) 11:00 12/13 (Fri) 13:00 Due to the “ABCI Grand Challenge 2019#3”, jobs do not run except those of the participants of the Grand Challenge. The interactive nodes, ABCI User Portal and qa@abci.ai mail service are available to all the users.
2019/12/6 (Fri) 12:30 18:30 266 compute nodes are unavailable due to the ABCI Grand Challenge 2019#3. The start of your jobs may be delayed depending on the number of jobs in the waiting queue.
2019/12/03 (Tue) 00:00 12/4 (Wed) 12:30 522 compute nodes are unavailable due to the ABCI Grand Challenge 2019#3. The start of your jobs may be delayed depending on the number of jobs in the waiting queue.
2019/11/19 (Tue) 00:08 00:09 The internet access of ABCI breaks due to the emergent maintenance of the network equipment.
2019/11/17 (Sun) 20:39 20:43 The internet access of ABCI was broken due to the trouble in the network equipment.
2019/11/17 (Sun) 04:03 04:06 The internet access of ABCI was broken due to the trouble in the network equipment.
2019/11/12 (Tue) 10:47 13:40 Out Of Mmemory caused troubles at es3.
2019/11/1 (Fri) 9:00 12:00 ABCI User Portal is unavailable due to the maintenance.
2019/10/9 (Wed) 12:00 10/10 (Thu) 12:30 522 compute nodes are unavailable due to the practice of ABCI Grand Challenge 2019#2. The start of your jobs may be delayed depending on the number of jobs in the waiting queue.
2019/10/3 12:30 10/4 17:00 ABCI all nodes, ABCI User Portal and qa@abci.ai mail service stop due to the maintenance of ABCI. All waiting jobs and reservations are cancelled.
2019/10/1 11:00 10/3 12:30 ABCI is out of service due to “ABCI Grand Challenge 2019#2”. The users other than the participants of the “ABCI Grand Challenge 2019#2” cannnot log on to the ABCI interactive nodes. Note: ABCI User Portal and qa@abci.ai mail service is available.
2019/9/30 12:00 10/1 11:00 522 compute nodes are unavailable due to the practice of ABCI Grand Challenge 2019#2. The start of your jobs may be delayed depending on the number of jobs in the waiting queue.
2019/9/25 18:00 9/25 18:40 Due to a DNS problem, abci.ai domain, including hosts such as as.abci.ai, could not be looked up.
2019/9/25 12:00 9/27 11:30 522 compute nodes are unavailable due to the practice of the rehearsal of ABCI Grand Challenge 2019#2. The start of your jobs may be delayed depending on the number of jobs in the waiting queue.
2019/8/24 15:00 8/24 21:11 /fs1 and /bb were not responding.
2019/8/15 04:38 8/15 11:08 /fs1 was not responding. Job submission had been restricted since 9:30 due to recovery work.
2019/7/16 20:10 7/16 20:40 /fs3 was not responding.
2019/7/11 15:30 7/11 16:50 /groups2 was not responding.
2019/7/3 10:00 7/3 13:20 The trouble that e-mail to inform you the login URL for ABCI User Portal is blocked is principally resolved.
But the timing it is actually resolved depends on the mail-server on your site. Please try to login later if the mail does not arrive yet. Contact us if it is not resolved even tomorrow(7/4).
2019/6/24 11:00 6/27 12:30 ABCI was out of service due to “ABCI Grand Challenge
2019#1”. The users other than the participants of the
“ABCI Grand Challenge 2019#1” cannnot log on to the
ABCI interactive nodes.
Note: ABCI User Portal and qa@abci.ai mail service
are available.
2019/6/27 12:30 6/28 21:00 ABCI all nodes, ABCI User Portal and qa@abci.ai
mail service stop due to the maintenance of
the water cooling system and ABCI.
(Planned schedule may be changed.)
2019/5/20 13:51 5/20 19:00 ABCI job service (SPOT, ON-DEMAND, RESERVED),
ABCI group storage area and ABCI intaractive nodes(es)
were unavailable due to the problem.
2019/5/10 11:06 5/10 18:00 ABCI job service (SPOT, ON-DEMAND, RESERVED) and
ABCI group storage were unavailable due to the problem.
2019/4/3 13:00 4/5 14:00 Due to maintenance, ABCI all nodes and qa@abci.ai mail service stopped.
ABCI User Portal is available.
2019/4/1 09:00 4/3 13:00 Due to maintenance, ABCI all nodes, qa@abci.ai mail service
and ABCI User Portal stopped.
2019/3/15 17:00 3/18 17:00 Due to maintenance, ABCI User Portal stopped.
2019/1/28 11:00 1/31 13:00 ABCI is out of service due to “ABCI Grand Challenge
#3”. The users other than the participants of the
“ABCI Grand Challenge #3” cannnot log on to the
ABCI interactive nodes.
Note: ABCI User Portal and qa@abci.ai mail service
is available.
2019/1/18 12:00 1/18 13:00 Network maintenance in AIST Kashiwa center.
Up to 10 minutes network down between ABCI and
INTERNET(SINET5) might happen for several times.
During the maintenance, there might be the down of the
sessions of the interactive node and the ABCI user portal,
temporary stop of mails to/from qa@abci.ai and so on.
2018/12/14 15:00 12/17 21:00 ABCI all nodes, ABCI User Portal and qa@abci.ai
mail service stopped, due to the electric power
outage and the maintenance.
2018/11/29 18:00 11/30 11:00 Due to the rack maintenance, the following
calculation nodes stopped.
g0885-0918
2018/11/26 09:00 11/29 18:00 Due to the rack maintenance, the following
calculation nodes stopped.
g0545-1088
2018/11/19 09:00 11/26 09:30 Due to the rack maintenance, the following
calculation nodes stopped.
g0001-0544
2018/11/02 15:00 11/05 15:00 ABCI all nodes, ABCI User Portal and qa@abci.ai
mail service stopped, due to the rack maintenance.
2018/10/26 15:00 10/29 17:00 ABCI all nodes, ABCI User Portal and qa@abci.ai
mail service stopped, due to the electric power
outage and the maintenance.
2018/10/23 09:00 10/26 10:00 ABCI is out of service due to “ABCI Grand Challenge
#2”.The users other than the participants of the
“ABCI Grand Challenge #2” cannnot log on to the
ABCI interactive nodes.
Note: ABCI User Portal and qa@abci.ai mail service
is available.
2018/10/17 09:15 10/17 14:15 ABCI User Portal stopped due to the problem.
2018/9/21 13:00 9/26 17:00 ABCI all nodes, ABCI User Portal and qa@abci.ai
mail service stopped, due to the electric power
outage and the maintenance.
2018/7/27 14:00 8/1 10:00 ABCI User Portal stopped, due to the maintenance.

ABCI Service In Date/Time

(As of 2021/05/10)

Start Date Time Item Name for Service In
2018/7/20 14:00 The ABCI user portal has started accepting ABCI group/account applications.
2018/8/1 13:00 ABCI has started providing services.
2019/12/17 00:00 ABCI has started providing service of new computing resources, M.large and M.small.
2020/1/1 00:00 The ABCI Cloud Storage Service started officially.
2021/5/10 13:00 “ABCI 2.0” ( Compute Node(A) ) has started providing services.