troubleshooting and repair guide for avalonminer models ... · idle. check if the network is ok or...

29
Page 1 / 29 Troubleshooting and repair guide for AvalonMiner models 721, 741, 761, 821 and 841 Figure 1 Ongoing repair process for an AvalonMiner 821 unit.

Upload: others

Post on 23-Sep-2020

479 views

Category:

Documents


68 download

TRANSCRIPT

Page 1: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 1 / 29

Troubleshooting and repair guide for AvalonMiner

models 721, 741, 761, 821 and 841

Figure 1 Ongoing repair process for an AvalonMiner 821 unit.

Page 2: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 2 / 29

Table of Contents List of Figures and Tables

1. General information, most common tools needed ................................................................................... 4

2. Troubleshooting ........................................................................................................................................ 8

2.1 How to indicate your miner’s physical location ................................................................................ 8

2.2 Information about indicator LED colors and CGMiner API Log error codes .................................... 11

2.3 How to read the abbreviations and values found in the CGMiner API Log ..................................... 13

2.4 How to check the power supply unit health ................................................................................... 16

2.5 Cooling fan troubleshooting ............................................................................................................ 18

2.6 AvalonMiner Controller / Raspberry Pi troubleshooting ................................................................ 19

2.7 AUC3 Converter troubleshooting .................................................................................................... 19

2.8 Control board (MM board) troubleshooting ................................................................................... 19

2.9 PMU board troubleshooting ............................................................................................................ 20

3. Tear down ................................................................................................................................................ 22

3.1 Step 1 – Steel tube case ................................................................................................................... 22

3.2 Step 2 – Cooling fan ......................................................................................................................... 23

3.3 Step 3 – Cooling fan extension cable ............................................................................................... 24

3.4 Step 4 – Control board (MM board) ................................................................................................ 25

3.5 Step 5 – Steel grill ............................................................................................................................ 26

3.6 Step 6 – Frame parts ........................................................................................................................ 26

3.7 Step 7 – PMU board ......................................................................................................................... 27

3.8 Step 8 – Hash board ......................................................................................................................... 27

4. Conclusion ............................................................................................................................................... 29

Page 3: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 3 / 29

List of Figures and Tables

Figure 1 Ongoing repair process for an AvalonMiner 821 unit. ......................................................................................... 1 Figure 3 Phillips head screwdriver. ..................................................................................................................................... 4 Figure 2 Digital multimeter. ................................................................................................................................................ 4 Figure 4 ESD mat and antistatic wrist strap. ....................................................................................................................... 5 Figure 5 Thermal paste. ...................................................................................................................................................... 5 Figure 6 Isopropyl alcohol................................................................................................................................................... 5 Figure 7 AvalonMiner 721 technical specifications. ........................................................................................................... 6 Figure 8 AvalonMiner 741 technical specifications. ........................................................................................................... 6 Figure 9 AvalonMiner 761 technical specifications. ........................................................................................................... 7 Figure 10 AvalonMiner 821 technical specifications. ......................................................................................................... 7 Figure 11 AvalonMiner 841 technical specifications. ......................................................................................................... 7 Figure 12 Normal miner operation before pressing the indicator LED button - the indicator LED color is blue................ 8 Figure 13 After pressing the indicator LED button the miner in question changes the indicator LED color to white. ....... 8 Figure 14 Web configuration page, login screen. ............................................................................................................... 9 Figure 15 Dashboard view. ................................................................................................................................................. 9 Figure 16 System overview. .............................................................................................................................................. 10 Figure 17 CGMiner API Log. .............................................................................................................................................. 10 Figure 18 6-pin PCI-E power connector pinout................................................................................................................. 16 Figure 19 Selecting the right measuring mode for the 12 VDC voltage measurement. ................................................... 16 Figure 20 Measuring the voltage. ..................................................................................................................................... 17 Figure 21 Fan speed 0 rpm at the miner status page. ..................................................................................................... 18 Figure 22 Fan speed 0 rpm at the CGMiner API Log page. .............................................................................................. 18 Figure 23 Both PMU boards are working normally. ......................................................................................................... 20 Figure 24 The second PMU board is not working at all. ................................................................................................... 20 Figure 25 PMU board identification. ................................................................................................................................ 20 Figure 26 Number of works shown in the log. This miner is working normally. .............................................................. 21 Figure 27 Every single ASIC chip temperature shown in the log. ..................................................................................... 21 Figure 28 Hash board identification. ................................................................................................................................ 21 Figure 29 Removing the steel tube case. .......................................................................................................................... 22 Figure 30 Steel tube case removal completed. ................................................................................................................ 22 Figure 31 Removing the cooling fan shroud. .................................................................................................................... 23 Figure 32 Unplugging the 4-pin fan cable connector........................................................................................................ 23 Figure 33 Removing the cooling fan. ................................................................................................................................ 24 Figure 34 Unplugging the 4-pin fan extension cable from the control board. ................................................................. 24 Figure 35 Removing the control board. ............................................................................................................................ 25 Figure 36 Removing the control board. ............................................................................................................................ 25 Figure 37 Removing the hexagon pattern steel grill. ........................................................................................................ 26 Figure 38 Removing the frame parts. ............................................................................................................................... 26 Figure 39 Removing the PMU board. ............................................................................................................................... 27 Figure 40 Two hash boards with one shared large heat sink and two invidual heat sinks............................................... 27 Figure 41 Removing the old thermal paste. ..................................................................................................................... 28 Figure 42 Applying new thermal paste. ............................................................................................................................ 28 Figure 43 Canaan Creative................................................................................................................................................. 29

Table 1 AvalonMiner LED colors. ...................................................................................................................................... 11 Table 2 PMU board LED colors. ........................................................................................................................................ 11 Table 3 AUC3 Converter LED colors. ................................................................................................................................. 11 Table 4 CGMiner API Log error codes. .............................................................................................................................. 12 Table 5 List of codes found in the CGMiner API Log. Part 1. ............................................................................................ 13 Table 6 List of codes found in the CGMiner API Log. Part 2. ............................................................................................ 14 Table 7 List of codes found in the CGMiner API Log. Part 3. ............................................................................................ 15

Page 4: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 4 / 29

1. General information, most common tools needed

This is an official troubleshooting and repair guide for AvalonMiner 7 and AvalonMiner 8 series. All rights to fix or improve this document are reserved to Canaan Creative. In this guide you will see how to disassemble and separate all miner components and it will help you to understand how to replace a faulty miner component like for example a broken cooling fan or faulty PCB board. You will also learn how to do the fault diagnosis to your miner in a case of a miner malfunction and how to find out which miner component is most likely not working correctly. This guide is made using an AvalonMiner 841 unit, but same principles apply with other compatible models mentioned in this guide as well. Most common tools needed:

• Phillips headed screwdriver (it is good to have both PH1 and PH2 size screwdrivers / tips)

• Digital multimeter

• Static electricity protection (ESD mat or antistatic wrist strap for example)

• If you don’t have anything for static electricity protection, please at least ground yourself by touching a grounded object to discharge the static electricity before touching a PCB board

• Good quality thermal paste

• Pure isopropyl alcohol for cleaning the old thermal paste

• Soft cloth or soft paper for cleaning the old thermal paste.

Figure 3 Phillips head screwdriver.

Figure 2 Digital multimeter.

Page 5: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 5 / 29

Figure 4 ESD mat and antistatic wrist strap.

Figure 5 Thermal paste.

Figure 6 Isopropyl alcohol.

Page 6: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 6 / 29

These instructions apply for following AvalonMiner 7 and 8 series models:

• AvalonMiner 721

• AvalonMiner 741

• AvalonMiner 761 (with a built-in power supply unit)

• AvalonMiner 821

• AvalonMiner 841

Figure 7 AvalonMiner 721 technical specifications.

Figure 8 AvalonMiner 741 technical specifications.

Page 7: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 7 / 29

Figure 9 AvalonMiner 761 technical specifications.

Figure 10 AvalonMiner 821 technical specifications.

Figure 11 AvalonMiner 841 technical specifications.

Page 8: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 8 / 29

2. Troubleshooting

In this chapter you will see the common methods for the troubleshooting process and you will also see example cases about the most common issues. The goal is to help understand how to determine which miner component is causing the miner operation malfunction.

2.1 How to indicate your miner’s physical location You can use the indicator LED to help you locate the miner. Just simply press the indicator LED switch button to change the state from ’LED OFF’ to ’LED ON’. You can find the indicator LED switch button location by following these steps:

1. Go the controller web configuration page by using your web browser and login. 2. Click ’Advanced Version’ to access the advanced user features in the OpenWrt firmware. 3. Click ’CGMiner Status’ to see status for all the devices connected to the controller. 4. Choose the AvalonMiner unit which you would like to locate and click ’LED OFF’ button located in

the Indicator field. Now you should see that status change into ’LED ON’. Further observations about this situation are shown in two figures below (Figure 12 and Figure 13).

Video: How to Identify your Avalonminer for Troubleshooting https://www.youtube.com/watch?v=vNMW0QtX5Bk

How to locate and open the CGMiner API Log

Figure 12 Normal miner operation before pressing the indicator LED button - the indicator LED color is blue.

Figure 13 After pressing the indicator LED button the miner in question changes the indicator LED color to white.

Page 9: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 9 / 29

These following instructions will tell you how to locate and open the CGMiner API Log. By reading the log text you are able to see lots of technical information related to the miner performance. Step 1. Use your web browser and access to the controller web configuration page.

Figure 14 Web configuration page, login screen.

Step 2. Click ’Advanced Version’ to access all user features.

Figure 15 Dashboard view.

Page 10: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 10 / 29

Step 3. You are now able to see all the features available. Now you need to click ’CGMiner API Log’ to access the log text.

Step 4. Click ’Debug Switch’ and after that you are ready to read and examine the CGMiner API Log.

Information about indicator LED colors and CGMiner API Log error codes

Figure 16 System overview.

Figure 17 CGMiner API Log.

Page 11: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 11 / 29

2.2 Information about indicator LED colors and CGMiner API Log error codes During the troubleshooting it is best to find and use all the information available, so that the repair is a quick and straight forward process. The miner is able to report about specific issue with different LED colors and error codes. In these following tables you can see a description for each LED color and error code. Please remember to click ’Debug Switch’ button before reading and examining the CGMiner API Log. Video: Avalonminer 741 and 841 - LED Color Guide https://www.youtube.com/watch?v=P59HPZnIk7s

AvalonMiner LED colors

LED status Description

White Power on.

Blue Working normally.

Yellow Idle normally.

Green Detect cooling fan and MCU board.

White blinking or constant white Turn on LED manually.

Red Toohot/Loopback failed/PG failed/Core test failed/Voltage error/Temperature sensor error/No

fan. Table 1 AvalonMiner LED colors.

PMU board LED colors LED status Description

Green Power ok.

Red Power is not good.

No LED light Usually means that the PMU board is faulty. Table 2 PMU board LED colors.

AUC3 Converter LED colors

LED status Description

Blue Initializing or idle normally.

Green Working normally.

Red Communication failed or rejected share (if the LED blinks shortly) Table 3 AUC3 Converter LED colors.

One of the most frequent errors is the socket connect error. Socket connect failed => Connection refused This error can happen because of numerous different things, but here is a list showing few of them.

• Miner is still in the startup process

• Incorrect pool settings

• Wrong OpenWrt firmware version

• Communication error.

Page 12: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 12 / 29

CGMiner API Log error codes

Code Name Type Description What to do

1 CODE_IDLE WARN Idle. Check if the network is ok or if the AUC3 is working normally.

2 CODE_MMCRCFAILED WARN IIC rx irc mismatch was found.

If the CGMiner has restarted ok, you can ignore this warning.

4 CODE_NOFAN FATAL Cooling fan can not be found.

Check the fan connection and condition.

8 CODE_LOCK WARN Reserved. Ask for unlock service.

16 CODE_APIFIFOOVERFLOW WARN API fifo overflow flag. This is just a notice on the API fifo.

32 CODE_RBOVERFLOW WARN Nonce ringbuffer overflow.

If the CGMiner has restarted ok, you can ignore this warning.

64 CODE_TOOHOT FATAL Hash board or control board (MM board) temperature is too

hot.

Check the fan or replace the too hot module.

128 CODE_HOTBEFORE WARN ’Too hot’ was found on the control board.

This is just a notice for the too hot module.

256 CODE_LOOPFAILED FATAL Loopback test failed. The LED will turn to red. Ignore it if the hash rate is not so bad.

512 CODE_CORETESTFAILED WARN Core test failed. Ignore it if the hash rate is not so bad.

1024 CODE_INVALIDPMU FATAL Invalid PMU firmware version or the PMU is

broken.

Replace a new PMU (power management unit) board.

2048 CODE_PGFAILED FATAL The power is not good.

Check the PMU board.

4096 CODE_NTC_ERR FATAL Temperature sensor check failed.

Replace the board which the code was found on.

8192 CODE_VOL_ERR FATAL Voltage input sensor check failed.

Replace the board which the code was found on.

16384 CODE_VCORE_ERR FATAL Voltage output sensor check failed.

Replace the board which the code was found on.

32768 CODE_PMUCRCFAILED WARN This is just a notice. Ignore it please.

65536 CODE_INVALID_PLL_VALUE WARN PLL configration check failed.

Reboot the unit, if it does not help then you need to replace the board in question.

131072 CODE_HUFAILED WARN HU configration check failed.

Reboot the unit, if it is not good then try to replace the HU.

Table 4 CGMiner API Log error codes.

WARN: It designates potentially harmful situations, but you can ignore it.

FATAL: It designates very severe error events that will presumably lead the AvalonMiner to crash.

Page 13: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 13 / 29

2.3 How to read the abbreviations and values found in the CGMiner API Log In this following table you can see the CGMiner API Log thoroughly explained, row by row.

This provides you a lot of help in the troubleshooting process, as you are able to determine possible issues

based on the values seen in the CGMiner API Log. The information is divided into three parts, part 1-3.

Table 5 List of codes found in the CGMiner API Log. Part 1.

Code Description

Miner Log Description.

[Firmware Version] => Avalon Firmware - 20180305

OpenWrt firmware version you are running.

luci: 62d814c The github source code tag.

cgminer: b5b497e The github source code tag.

cgminer-packages: 960e108 The github source code tag.

[STATS] => 0

[ID] => AV80 Miner Model: 8, AUC ID: 0.

[Elapsed] => 134 Unit: s. Time elapsed.

[Calls] => 0

[Wait] => 0.000000

[Max] => 0.000000

[Min] => 99999999.000000

[MM ID1] => Ver[8411802-1e73b80] Raw Cgminer log output.

DNA[0133796fb64c9538] Unique Machine ID. Last four letters are shown in the miner status page. This is tied to Miner Management board (MM ) -- DNA read from FPGA.

Elapsed[108] Unit: s. The elapsed hashing time for the miner.

MW[26335 26109 26186 26247] Hash boards’ works. One work means one block header.

LW[104877] Local Work (Same definition as Cgminer).

MH[0 0 0 0] Hardware errors on each Hash board.

HW[0] Sum of the error number values in the MH field.

Temp[27] Unit: °C. Intake temperature.

TMax[62] Unit: °C. Maximum temperature. This is the temperature value for the hottest ASIC chip.

Fan[5820] Unit: rpm. Fan speed, read from the fan tachometer.

FanR[100%] Unit: %. Control value for the fan speed. Set by the CGMiner.

Vi[1208 1208 1207 1207] Unit: VDC. PMU board input voltage value.

Vo[3806 3812 3787 3812] Unit: VDC. PMU board output voltage value.

PLL0[1 2 42 4427] Core counts distribution on Hash board 0, 1, 2 and 3. Core counts on different PLL . There are 4 PLLs in A3210, the PLL setting can be found in SF[xx xx xx xx]. The number is the [inside chip hash core amount] that running at which PLL. So, for example in this configuration: 1 runs at PLL0, 2 runs at PLL1, 24 cores run at PLL2, the rest all runs at PLL3.

PLL1[3 1 92 4376]

PLL2[0 0 7 4465]

PLL3[0 1 14 4457]

GHSmm[13736.88] Unit: Gh/s. Hash rate calculated through average frequency.

Page 14: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 14 / 29

Table 6 List of codes found in the CGMiner API Log. Part 2.

Code Description

WU[121295.33] WU: The Work Utility defined as the number of diff1 shares work / minute.

Freq[767.94] Unit: Hz. Average frequency. Every core in the chip will work at different frequency, so this is the average frequency.

PG[15] PMU output status (Power Good). If the value is low, please start by checking your power supply health.

Led[0] LED indicator color code.

MW0[5 2 4 2 6 2 5 3 0 3 3 8 7 5 2 8 8 6 3 7 4 9 7 4 9 4]

The work that has been accepted of each chip on the Hash board 0.

MW1[2 5 7 5 4 5 3 4 2 3 5 2 4 4 1 5 1 5 5 2 7 3 5 3 3 2]

The work that has been accepted of each chip on the Hash board 1.

MW2[4 1 7 1 3 6 6 8 7 1 4 6 5 5 4 6 2 4 5 1 5 8 6 6 2 4]

The work that has been accepted of each chip on the Hash board 2.

MW3[4 1 4 6 5 2 3 4 3 4 3 3 0 5 3 3 3 5 3 5 5 4 3 7 4 3]

The work that has been accepted of each chip on the Hash board 3.

TA[0] Total available chips.

ECHU[0 0 0 0] Error code of Hash board 0 – 3. https://github.com/Canaan-Creative/avalon8-docs/wiki/AvalonMiner-error-code

ECMM[0] Error code of Miner Management (MM) board. https://github.com/Canaan-Creative/avalon8-docs/wiki/AvalonMiner-error-code

FAC0[4] Voltage level (Factory information).

OC[0] Over clocking flag (0/1).

SF0[0 0 0 775] Frequency set by CGMiner (it should be 25x, or it will change to 25x). Only use settings that are multiples of 25 Hz.

SF1[0 0 0 775]

SF2[0 0 0 775]

SF3[0 0 0 775]

PMUV[0000 0000] PMU version.

PVT_T0[54 56 57 58 58 59 60 60 61 59 60 60 60 62 57 59 60 61 60 61 61 59 57 57 54 53]

Unit: °C. Chip temperature for the chips on Hash Board 0.

PVT_T1[53 54 55 55 56 56 57 57 58 58 58 58 57 57 58 59 59 58 57 58 57 56 55 55 54 53]

Unit: °C. Chip temperature for the chips on Hash Board 1.

PVT_T2[54 55 55 56 57 58 57 58 59 59 60 59 59 59 60 60 59 59 59 59 58 58 57 56 55 54]

Unit: °C. Chip temperature for the chips on Hash Board 2.

PVT_T3[56 56 57 58 58 59 60 59 59 59 60 59 60 58 59 60 59 59 59 61 59 59 57 57 55 56]

Unit: °C. Chip temperature for the chips on Hash Board 3.

PVT_V0_0[751 751 376 376 1 896 897 743] The voltage of the 1st chip on Hash board 0.

PVT_V0_1[751 751 365 365 1 899 703 744] The voltage of the 2nd chip on Hash board 0.

PVT_V0_2[757 757 366 366 1 893 711 750] The voltage of the 3rd chip on Hash board 0.

PVT_V0_3[749 749 362 362 1 904 703 740] The voltage of the 4th chip on Hash board 0.

-//- -//-

Page 15: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 15 / 29

Table 7 List of codes found in the CGMiner API Log. Part 3.

Code Description

PVT_V1_0[755 755 364 364 1 900 897 746] The voltage of the 1st chip on Hash board 1.

PVT_V1_1[752 752 358 358 1 907 713 743] The voltage of the 2nd chip on Hash board 1.

PVT_V1_2[747 746 367 367 1 896 722 737] The voltage of the 3rd chip on Hash board 1.

PVT_V1_3[751 751 370 370 1 894 705 741] The voltage of the 4th chip on Hash board 1.

-//- -//-

PVT_V2_0[753 753 364 364 1 899 893 743] The voltage of the 1st chip on Hash board 2.

PVT_V2_1[755 755 367 367 1 900 711 746] The voltage of the 2nd chip on Hash board 2.

PVT_V2_2[754 754 357 357 1 898 711 745] The voltage of the 3rd chip on Hash board 2.

PVT_V2_3[752 752 353 354 1 901 712 742] The voltage of the 4th chip on Hash board 2.

-//- -//-

PVT_V3_0[753 753 362 362 1 906 885 743] The voltage of the 1st chip on Hash board 3.

PVT_V3_1[749 749 362 362 1 900 718 741] The voltage of the 2nd chip on Hash board 3.

PVT_V3_2[756 756 354 354 1 897 714 747] The voltage of the 3rd chip on Hash board 3.

PVT_V3_3[749 749 366 366 1 896 713 740] The voltage of the 4th chip on Hash board 3.

-//- -//-

FM[1] Status of current frequency settings (0: init, 1: normal).

CRC[0 0 0 7] CRC: The count of invalid package which are found in the every hash board.

[MM Count] => 1 MM Count: Miner counts under the AUC.

[Smart Speed] => 1 Smart Speed: Smart speed flag (0: Disable, 1: Enable).

[Connecter] => AUC Connecter: AUC / IIC. AUC is AvalonMiner USB Converter and IIC is the Raspberry Pi Inter-Integrated Circuit communication.

[AUC VER] => AUC-20151208 AUC firmware version.

[AUC I2C Speed] => 400000 AUC IIC communication speed.

[AUC I2C XDelay] => 19200 AUC IIC delay for each data transfer.

[AUC Sensor] => 15580 AUC temperature sensor value.

[AUC Temperature] => 27.86 Unit: °C. Convert the sensor raw data to human readable form.

[Connection Overloaded] => false AUC Overload flag. It will be true, if you for example connect seven AvalonMiners on one AUC.

[Voltage Level Offset] => 0 Voltage level offset setting.

[Nonce Mask] => 24 Parameters for the Smart Speed.

[USB Pipe] => 0 Status of the USB communication.

[USB Delay] => r0 0.000000 w0 0.000000

[USB tmo] => 0 0

Video: Avalonminer Troubleshooting Guide : CGMiner API LOG Guide.

https://www.youtube.com/watch?v=-kpcLY7RoTE

Page 16: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 16 / 29

2.4 How to check the power supply unit health If you want to know if the power supply is passing any voltage to the PMU board, you can measure the +12V rail output voltage to find out. According to the ATX specification, the measured voltage should be from 11.4 to 12.6 VDC. Please also take into account the rated input voltage for your AvalonMiner. The most safe area for the input voltage is 12.0 to 12.5 VDC. Video: How to check your AvalonMiner power supply https://www.youtube.com/watch?v=1Ty3D5vuj_I For measuring the voltage you need a multimeter.

You can easily measure the voltage by applying one multimeter test lead to the +12V (yellow) and one

multimeter test lead to the COM (black).

First you need to select the right measuring mode by using the selection knob in the middle of the

multimeter. Make sure that your test leads are connected correctly to the multimeter for voltage

measurement setup.

Figure 18 6-pin PCI-E power connector pinout.

Figure 19 Selecting the right measuring mode for the 12 VDC voltage measurement.

Page 17: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 17 / 29

You can use any safe methods, tools or tape you need to hold the power connector still so that it is easy for

you to hold the test leads safely in position.

Figure 20 Measuring the voltage.

Page 18: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 18 / 29

2.5 Cooling fan troubleshooting If your miner won’t start operating, one of the first things to check if the cooling fan is working. You can check it by doing both visual inspection and looking at the miner status page. If you find a miner with fan reading at 0 rpm or close to it, then it is quite clear that you need to replace that miner’s cooling fan. Please also see if there are any indicator LED colors or log error codes related to a cooling fan issue.

Figure 21 Fan speed 0 rpm at the miner status page.

Figure 22 Fan speed 0 rpm at the CGMiner API Log page.

If the cooling fan failure occurs during the warranty period, you can request a replacement cooling fan from

Canaan Creative.

If your miner is out of warranty, here is a list of few compatible replacement cooling fan models:

• Cooler Master A12038-60BB-4RP-F1

• Delta Electronics QFR1212GHE

• Delta Electronics QFR1212GHEXUM

• Nidec V12E12BS1B5-07

• Sanyo Denki 9GV1212P1J01

• Shengjiu SG121238BS

• Sunon PSD1212PMBXA-F-PWM

Page 19: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 19 / 29

2.6 AvalonMiner Controller / Raspberry Pi troubleshooting If you are not able to access the controller unit at all and you have double-checked that your setup is ok,

then it is possible that your controller unit is faulty and needs to be replaced with a new one.

Please also make sure that the 5 VDC wall wart is in ok condition and that it is able to provide at least 5 VDC

2.5 A power for the controller unit.

Here is the list of compatible controller units:

• AvalonMiner Controller (RPi 3 based device)

• Raspberry Pi 1 Model B

• Raspberry Pi 2 Model B

• Raspberry Pi 3 Model B

2.7 AUC3 Converter troubleshooting If you are able to access the controller unit successfully, but you don’t see any miner devices connected, it

is possible that there is a faulty AUC3 converter or a faulty control board (MM board).

You can look at the LED colors and log error codes and try to figure out the issue that way.

You can try a spare AUC3 converter if you have another one.

You can easily troubleshoot if it is the AUC3 which is faulty, if you have a ten miners or more in your setup.

This means that you have at least two AUC3 converters in your setup. You can swap those two AUC3

converters and try to see if the issue moves to the another group of miners. If it does, you should be able to

know which one is the faulty AUC3 converter.

2.8 Control board (MM board) troubleshooting If you are able to access the controller unit successfully, but you don’t see a miner device connected, it is

possible that there is a faulty AUC3 converter or a faulty control board (MM board).

Another example of possible control board malfunction is that if there is a huge number hash boards down

suddenly.

One good hint is that you have for example five miners connected to one AUC3 converter, and you are only

able to connect four of them, it indicates that there might be issues with one miner control board.

You can look at the LED colors and log error codes and try to figure out the issue that way.

You can try using a spare control board (MM board) for the miner in question if you have another one.

Page 20: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 20 / 29

2.9 PMU board troubleshooting PMU board stands for Power Management Unit board.

If you have one or more hash boards down and without power, it is a good idea to figure out how is the

PMU board condition. You can do this by looking at the CGMiner API Log.

If you any values zero or values having an odd number, it is possible that you have a faulty PMU board.

When you look at the log, please pay attention to Vi and Vo values (voltage in and voltage out).

One PMU board has two power terminals, so there is a total number of four values visible in the log. There

are four digits presenting the measured input voltage and output voltage. For example value 1199 means

11.99 VDC.

Figure 23 Both PMU boards are working normally.

Figure 24 The second PMU board is not working at all.

In this next image you can see how to identify which PMU board is the first one and which is the second.

Hash board

Figure 25 PMU board identification.

Page 21: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 21 / 29

To find out if you have a hash board issue, you can look at two things in the CGMiner API log.

• Works (MW0 – MW3)

• Chip temperatures (PVT_T0 – PVT_T3).

If there is one board showing no Works, then it is clearly a faulty one. Your miner might also have issues with constant rebooting because one of the ASIC chips overheats. In this case you can first try to clean the old thermal paste and then apply new thermal paste. If new thermal paste does no good, you need a new hash board.

In this next image you can see how to identify which ID number is for which hash board.

Figure 26 Number of works shown in the log. This miner is working normally.

Figure 27 Every single ASIC chip temperature shown in the log.

Figure 28 Hash board identification.

Page 22: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 22 / 29

3. Tear down You mostly need only one tool for the tear down process and it is a screwdriver. Make sure you have a good quality Phillips head screwdriver and right sized, to avoid damaging the small bolts. Please also be careful not to use too much force which can potentially damage your screwdriver, bolt or the thread. Please pay attention using the right type of a bolt when assembling it all back together. These instructions show you how to separate each miner component, so that you are able to replace a faulty part in question with a new spare part. Video: Avalonminer 741 Disassembly Guide https://www.youtube.com/watch?v=1W_BO1PimlA

3.1 Step 1 – Steel tube case Remove the steel tube case. There are four small bolts in both sides of the miner, so a total number of eight bolts.

Figure 29 Removing the steel tube case.

Figure 30 Steel tube case removal completed.

Page 23: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 23 / 29

3.2 Step 2 – Cooling fan Remove the cooling fan. There are four screws holding the cooling fan shroud against the frame.

Figure 31 Removing the cooling fan shroud.

Unplug the 4-pin fan cable connector and remove the cooling fan.

Figure 32 Unplugging the 4-pin fan cable connector.

Video: Avalonminer 741 and 841 Fan Replacement Tutorial https://www.youtube.com/watch?v=-ltqGavbzd0

Page 24: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 24 / 29

Figure 33 Removing the cooling fan.

3.3 Step 3 – Cooling fan extension cable Unplug the 4-pin fan extension cable from the control board.

Figure 34 Unplugging the 4-pin fan extension cable from the control board.

Page 25: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 25 / 29

3.4 Step 4 – Control board (MM board) Remove the control board (Miner Management board). There are two small bolts to be removed, after which you can gently detach the control board which is connected to both two PMU boards (Power Management Unit).

Figure 36 Removing the control board.

Figure 35 Removing the control board.

Page 26: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 26 / 29

3.5 Step 5 – Steel grill Remove the hexagon pattern steel grill. There are four small bolts holding it against the frame.

Figure 37 Removing the hexagon pattern steel grill.

3.6 Step 6 – Frame parts Remove both two frame parts. There are six small bolts holding one frame parts against the heat sinks, so a total number of twelve bolts.

Figure 38 Removing the frame parts.

Page 27: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 27 / 29

3.7 Step 7 – PMU board Remove both two power management unit boards (PMU board). There are six bolts per PMU board and four of them connect the power from the PMU board to the hash board, so a total number of twelve bolts.

Figure 39 Removing the PMU board.

3.8 Step 8 – Hash board Remove the hash board. There are three small bolts and six big bolts holding one hash board against the heat sinks.

Figure 40 Two hash boards with one shared large heat sink and two invidual heat sinks.

Page 28: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 28 / 29

Please remove the old thermal paste if needed, if it is hardened or damaged in any different way. You should do this by using pure isopropyl alcohol and a soft cloth or paper. You can also use a cotton swab if needed.

Figure 41 Removing the old thermal paste.

After the surfaces are cleaned, you can apply new thermal paste. Do not use your finger or anything else that causes impurities to mix with the thermal paste when applying the new thermal paste.

Figure 42 Applying new thermal paste.

Now we have gone through how to tear down the miner, how to separate all the miner components and how to replace each invidual miner component. Assembling it all back together is essentially doing this same process backwards.

Page 29: Troubleshooting and repair guide for AvalonMiner models ... · Idle. Check if the network is ok or if the AUC3 is working normally. 2 CODE_MMCRCFAILED . WARN . IIC rx irc mismatch

Page 29 / 29

4. Conclusion

These instructions should be able to help you with most of the issues that might occur, even though AvalonMiner failure rate in field is close to 1%, which is a low number compared to the competitors.

But please remember that if you need help with an miner issue, you can always:

• Submit an email to [email protected]

Any questions, suggestions or comments are welcome by submitting an email to support Canaan Creative website.

Canaan Creative: https://canaanavalonminers.com