endpoints not self healing

I have been experiencing problems with endpoints when routers go away. I think I just don’t fully understand how the self healing of ZB works. The only way currently that I get the endpoints back is by power cycling them. Here are the configuration file for one of the offending endpoints as well as a router and the coordinator. Anything here that would improve or fix the ability to rejoin?

---- endpoint -----
XB24-ZB_28A7.mxi
80
0
251
28A7
0
[A]ID=100
[A]SC=3FFF
[A]SD=7
[A]ZS=1
[A]NJ=FF
[A]JN=1
[A]DH=0
[A]DL=0
[A]NI=END
[A]NH=1E
[A]BH=0
[A]DD=30000
[A]NT=FF
[A]NO=0
[A]CR=3
[A]SE=E8
[A]DE=E8
[A]CI=11
[A]PL=4
[A]PM=1
[A]EE=0
[A]EO=0
[A]BD=3
[A]NB=0
[A]SB=0
[A]RO=3
[A]D7=1
[A]D6=0
[A]CT=64
[A]GT=3E8
[A]CC=2B
[A]SM=4
[A]ST=3E8
[A]SP=AF0
[A]SN=5
[A]SO=6
[A]PO=3
[A]D0=1
[A]D1=2
[A]D2=0
[A]D3=0
[A]D4=0
[A]D5=1
[A]P0=1
[A]P1=0
[A]P2=0
[A]PR=0
[A]LT=0
[A]RP=28
[A]DO=1
[A]IR=FF
[A]IC=0
[A]V+=C00

— router—

XB24-ZB_22A7.mxi
80
0
251
22A7
0
[A]ID=100
[A]SC=3FFF
[A]SD=3
[A]ZS=1
[A]NJ=FF
[A]NW=2
[A]JV=1
[A]JN=1
[A]DH=0
[A]DL=0
[A]NI=ROUTER
[A]NH=1E
[A]BH=0
[A]AR=FF
[A]DD=30000
[A]NT=FF
[A]NO=0
[A]CR=3
[A]SE=E8
[A]DE=E8
[A]CI=11
[A]PL=4
[A]PM=1
[A]EE=0
[A]EO=0
[A]BD=3
[A]NB=0
[A]SB=0
[A]RO=3
[A]D7=1
[A]D6=0
[A]CT=64
[A]GT=3E8
[A]CC=2B
[A]SM=0
[A]SN=15
[A]SO=0
[A]SP=AF0
[A]ST=1388
[A]PO=3
[A]D0=5
[A]D1=5
[A]D2=5
[A]D3=5
[A]D4=0
[A]D5=0
[A]P0=1
[A]P1=0
[A]P2=0
[A]PR=FF
[A]LT=0
[A]RP=28
[A]DO=1
[A]IR=3A98
[A]IC=0
[A]V+=C00

—coordinator —

XB24-ZB_21A7.mxi
80
0
251
21A7
0
[A]ID=100
[A]SC=3FFF
[A]SD=3
[A]ZS=1
[A]NJ=FF
[A]DH=0
[A]DL=FFFF
[A]NI=
[A]NH=1E
[A]BH=0
[A]AR=FF
[A]DD=30000
[A]NT=3C
[A]NO=0
[A]CR=3
[A]PL=4
[A]PM=1
[A]EE=0
[A]EO=0
[A]BD=7
[A]NB=0
[A]SB=1
[A]D7=0
[A]D6=0
[A]AP=2
[A]AO=0
[A]SP=AF0
[A]SN=10
[A]D0=1
[A]D1=0
[A]D2=0
[A]D3=0
[A]P1=0
[A]P2=0
[A]PR=1FFF
[A]LT=0
[A]RP=28
[A]DO=1
[A]IR=0
[A]IC=0
[A]V+=0

How many attempts are you trying to send data before you are considering the re-joining a failure? The standard says that the end device must try 3 times before it goes in to a re-join and then a full join.

How many End device children do your other nodes currently have that are in range? If you are at the maximum, then the end device will not re-join the network.

these are standalone XBees, that is they are just monitoring switches or analog lines. I have let them go 20 hours waiting for them to rejoin.

It is a small network, 4 routers 4 endpoints and of course 1 coordinator

How many parent nodes are in range of the end device? What is the Number of children remaining on the other parent nodes?

sorry, I wend back to edit here is what I added

It is a small network, 4 routers 4 endpoints and of course 1 coordinator

sorry, I wend back to edit here is what I added

It is a very small network, with 4 routers 4 endpoints and of course 1 coordinator all less than max range 100 feet

Are the other Parent nodes in range of the end device in question?

two of the routers are less than 20 feet away including the one that failed.

my outage consisted of one failed router and two failed endpoints. first try was to bring back the router it came up and my coordinator could see it. the endpoints only came back when they were power cycled.

Did you try waking up the other two end devices three times before power cycling them?

No, how would I wake them up? which pin? RESET?

You’re currently using Cyclic sleep. Try using cyclic sleep with pin wake. Then you can use the Sleep request line to wake the module between cycles.

Actually two of my endpoints are “cyclic pin wake” and that has not caused them to get re-connected. Even if that had worked, it is not “self healing” this system is remote and I cannot drive 100 miles to press a button.

Check your sleep settings on your parent nodes. They should match your end devices.

Ok, here are my numbers. My understanding was that the sleep setting on routers and the coordinator needed to be equal to or greater than any endpoint. Do you think that they should be equal?


endpoint

SP=AF0 — 28 seconds
SN=5 — times 5
140 seconds

router

SP=AF0
SN=15
420 seconds

Coordinator

SP=AF0
SN=10
280 seconds

Please note that you are asking for help via a Support form that is user helping user. It is not the rout you go if you want Digi helping your resolve the issue. Next, there is not much users can do to help you or anyone help you if you do not have access to the system to address issue. Troubleshooting a network remotely is very difficult at best.

Now, what I would suggest is setting up a small network within your office where all of the nodes are at defaults minus the same sleep values and see if that works for you.

Thanks for the help, I’ll try setting all the sleep values the same and see if things change. I’ll try elsewhere to learn how the self healing works.