Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

XT8 Asus ZenWiFi cannot detect mesh nodes if in Access Point mode #542

Open
craigbailey18 opened this issue Feb 20, 2024 · 57 comments
Open
Assignees
Labels
bug Something isn't working Major Affects on of the main functionality of the router

Comments

@craigbailey18
Copy link

Router Model Affected
[ASUS ZenWiFi AX (XT8)]

Firmware Version Affected
[RT-AX95Q_3004_388.6_0-gnuton0_beta1_puresqubi.w] and [RT-AX95Q_3004_388.6_0-gnuton0_beta2_puresqubi.w]

Is this bug present in upstream Merlin releases too?
No, just both RT-AX95Q_3004_388.6 beta 1 and beta 2

Describe the bug
The Mesh nodes are not detected on the main router on this firmware. I have the RT-AX95Q and the firmware updates but the nodes don't show up. I have to downgrade the main router back to the older stable version for it to detect them again. All of my 3 nodes are the same model.

To Reproduce

  1. Update all the nodes to the latest beta firmware
  2. Update the main node (router/controller) to the latest firmware
  3. You will notice the nodes/mesh points are not detected or connected to the main node either wired or wirelessly anymore.

Expected behavior
I usually update it node's firmware first and then the main router. I followed the same process here and the nodes/mesh points never get detected. I even reset all the nodes and started from scratch and the issue remained. I reverted the main router/controller to the stable [3004.388.5_0-gnuton1] and the mesh points were detected again.

Screenshots
RT-AX95Q_3004_388.6 Beta 2 Screenshot
Beta 2 Firmware

RT-AX95Q_3004_388.5 Stable Screenshot
Stable Firmware 1
Stable Firmware 2

@craigbailey18 craigbailey18 added the bug Something isn't working label Feb 20, 2024
@robbbaxley
Copy link

This is the exact same issue I am seeing on my XT8's and I have 5 nodes (primary + 4 secondary mesh).

All of my mesh nodes are wired, are yours wired or Wi-Fi backhaul?

@craigbailey18
Copy link
Author

Yes my 2 mesh nodes are on wired backhaul.

@robbbaxley
Copy link

Hm, that's also just like me. My issue #532 is exactly the same as yours. This weekend I will try the 'fix' @gnuton suggested which is to clean the nvram then try the firmware again. Hopefully that will work, but I have always been successful with a 'normal' dirty flash.

@gnuton
Copy link
Owner

gnuton commented Feb 21, 2024

If you press f12 do you see any error in the developer console of chrome?

@craigbailey18
Copy link
Author

@robbbaxley I will have to probably try that when my wife is asleep. Lol! I have always been able to do dirty flash and it works fine. I just did the downgrade only on the main/controller and it worked flawlessly.

@craigbailey18
Copy link
Author

@gnuton I didn't try that but I will test again later tonight.

@robbbaxley
Copy link

@craigbailey18 , were you able to try the erase NVRAM per @gnuton ? I haven't been able to try it out yet.

@craigbailey18
Copy link
Author

I have been busy so didn't get a chance to do it yet.

@gurple
Copy link

gurple commented Feb 25, 2024

I'm also having odd issues with the XT8s (mostly in WiFi backhaul). All devices are now running 3004.388.6-gnuton1. It seems that now the 'hidden network' that the mesh sets up to manage itself is disabled. They seem to insist on using the weak user-facing 2.4GHz network for their backhaul channel.

Screenshot 2024-02-25 at 18 29 31

Screenshot 2024-02-25 at 18 45 50

@gurple
Copy link

gurple commented Feb 26, 2024

Just to be clear, before reporting this I did perform the full WPS-button-pushing reset. However, I didn't think to check the various radio settings of each XT8 after re-adding them to the network. My bad.

I'm back on the official firmware for these XT8s until I have more time to poke around. Please let me know if there are any specific things I should directly check at that time.

@robbbaxley
Copy link

I tested the new @gnuton pre-release, 3004.388.6_0-gnuton1, this morning on my XT8's and the remote/slave nodes updated OK without any obvious issues (wired backhaul). However, the primary/master node still exhibits the same issue where, when the 3004.388.6 firmware is loaded, all remote/slave nodes become inaccessible (invisible) to the primary/master XT8 node.

First, I did a dirty upgrade, then for my second attempt, I performed a 'clean' NVRAM upgrade, and there was no difference for me. Also, for the first time, downgrading failed until I performed a NVRAM erase. I was then able to upload the old configuration and perform a downgrade to the 3004.388.5_1 Gnuton firmware without any further issues.

So right now, my remote/slave XT8 nodes are running the pre-release, and my primary node is on the previous stable @gnuton 3004.388.5_0-gnuton1.

There's definitely something going on with the XT8's and this firmware version.

@craigbailey18
Copy link
Author

Yes I finally had a chance to try again this morning and I have the same issue on the primary node. I have the pre-release on the nodes and the official on the primary node.

@FlashLight34
Copy link

FlashLight34 commented Feb 26, 2024

Me to second node led flashing blue so im back with .5_gnu firmware(xt8)

I got this with the latest .6

@robbbaxley
Copy link

@gnuton ....do you have any idea what could be going on here? It appears that the XT8's have an issue with this version of the firmware on the primary/master node.

@gnuton
Copy link
Owner

gnuton commented Feb 27, 2024

No idea yet, I will update u as soon as I het what is causing the problem

@craigbailey18
Copy link
Author

Ok thanks @gnuton

@gnuton
Copy link
Owner

gnuton commented Feb 27, 2024

Is this affecting et8 too?

@Dodgydrains
Copy link

Dodgydrains commented Feb 27, 2024

@robbbaxley I have 4 x XT8's and dirty flashed 3004.388.6_0-gnuton1 on all the units over official FW 3.0.0.4.388_24609. Haven't experienced any issues whilst flashing. I always flash the furthest away node first and work my way up to the the primary router.

Is this what you do too?

@robbbaxley
Copy link

@Dodgydrains , yes that's my exact procedure as well. One thing I do notice is that all of us are in "AP" mode and have "Wired Backhaul". Is that your setup too? If it is, I will try what you did and dirty flash over the Asus original firmware. I did try a dirty and clean NVRAM flash...both failed for me but I was coming from the previous @gnuton version.

@Dodgydrains
Copy link

@robbbaxley My primary router is connected to my ISP provided modem. All my XT8's are wireless backhaul. Could that be the difference as you are wired backhaul?

@robbbaxley
Copy link

@Dodgydrains , it's certainly possible and I hope it points @gnuton in the directly of whatever has changed between this version and the previous version to cause this issue.

@craigbailey18
Copy link
Author

Yeah you guys could be right.

@gnuton
Copy link
Owner

gnuton commented Feb 28, 2024

@robbbaxley flashing stock and back makes config just more dirty and unpredictable.
Have you tried to reset to factory the master and one node and try to discover them again? What has happened? This is what has to be tested at first. You should not import any previous setting in this test.

@FlashLight34
Copy link

FlashLight34 commented Feb 28, 2024

Me i have wifi backhaul(2x xt8) and the problem is the second xt8 (node) not connect and flashing blue and the primary dont see him 😭

@robbbaxley
Copy link

@FlashLight34 , it does appear that this issue/bug is related somehow. I am going to attempt @gnuton's latest request to reset all of the devices and set them up as if they were all brand new but using the latest firmware.

@gnuton
Copy link
Owner

gnuton commented Feb 29, 2024

Ran a test yesterday and my xt8 was able to connect to the node via 2,4 GHz, but not via 5ghz. Maybe the issue is there. No idea why this happens

@robbbaxley
Copy link

@gnuton @FlashLight34 @craigbailey18 @Dodgydrains @gurple

Ok, so I did the last thing @gnuton recommended, which is defaulting the firmware of all devices and resetting up the system from scratch. What I did tonight was:

  1. Default all 3 XT8s (1 master and 2 slave nodes) and both XT9s (both slave nodes).
  2. All XT8 slave nodes were already on the latest version of gnuton's 388.6 firmware, Master node was on 388.5. XT9's are running default Asus firmware 3.0.0.4.388.23285
  3. I upgraded the Master node to 388.6
  4. I then re-setup the Master and 1 each of the XT8 and XT9 slave nodes in Router mode.
  5. Everything works as expected. No issue noted so far.

In the coming days I will move the system back to AP mode from Router mode to test to see if that was the problem...unless someone wants to try that first? :D

@craigbailey18
Copy link
Author

@gnuton @FlashLight34 @Dodgydrains @gurple
So I reset everything and set it up from scratch with the official 388_6 firmware in AP mode and it worked. It was a bit time-consuming since I had many custom settings but it finally worked. I am wondering if this has to be done with future firmware or if was it something different in 388_5 to 388_6 that didn't jive with one another.

@Dodgydrains
Copy link

@craigbailey18 good to hear it worked for you. Always best to do factory reset if you encounter issues to rule this out.

@craigbailey18
Copy link
Author

@hulleyrob I am sorry to hear it happened to you as well. I am hoping the new 388_7 addresses the bug.

@gnuton gnuton self-assigned this Mar 12, 2024
@gnuton gnuton added the Major Affects on of the main functionality of the router label Mar 14, 2024
@quangkieu
Copy link

quangkieu commented Mar 19, 2024

For my setup as in picture

Details

image

All mesh routers have direct LAN to a main switch then to main router but they still show using WIFI or LAN to second router.
But when I did speedtest, it was full 700-800mbs. As they are not that close to the main router, they would only connect at slow WIFI speed maybe 200-400mbs, last time when I manually connect to main router WIFI. So, I think they do have LAN backhaul but the graphic just messes up and they think it is in WIFI mode or do not have direct LAN to main router.

@ajayramaswamy
Copy link

Are you sure you are using gnuton firmware on a ZenWiFi XT8 / RT-AX95Q? The picture shows a ET8 and a non gnuton firmware

@craigbailey18
Copy link
Author

craigbailey18 commented Apr 2, 2024

@gnuton @robbbaxley The issue remains on the new 3004.388.6_2-gnuton0_beta1 firmware. I guess I have to stick to 3004.388.5_0-gnuton1 until the issue with the AP mode wired/wireless backhaul is figured out. I think there was a change between firmware 3004.388.5_0-gnuton1 and Unstable: 3004.388.6_0-gnuton0_beta1 that caused this bug.

@hulleyrob
Copy link

Thanks for the warning I hadn't got round to trying it out yet so I can save the hassle and wait for a new release.

@craigbailey18
Copy link
Author

You're welcome @hulleyrob

@gnuton gnuton changed the title Asus ZenWiFi Mesh Bug with 388.6 firmware XT8 Asus ZenWiFi cannot detect mesh nodes if in Access Ppint mode Apr 5, 2024
@gnuton gnuton changed the title XT8 Asus ZenWiFi cannot detect mesh nodes if in Access Ppint mode XT8 Asus ZenWiFi cannot detect mesh nodes if in Access Point mode Apr 5, 2024
@CHF-UK
Copy link

CHF-UK commented Apr 5, 2024

Is this affecting et8 too?

I have 4x ET8's in router mode which are connected wirelessly. Just dirty flashed 3004.388.6_2-gnuton1 and no obvious issues.

@adampk17
Copy link

I have 3 ET8's and I seem to be experiencing this issue on 3004.388.6_2-gnuton1.

@adampk17
Copy link

I have 3 ET8's and I seem to be experiencing this issue on 3004.388.6_2-gnuton1.

I can confirm downgrading to 3004.388.5_0-gnuton1 on my ET8's allowed me to set up AIMesh.

@CPngN
Copy link

CPngN commented Apr 11, 2024 via email

@adampk17
Copy link

Does it work ok with Ethernet link? I gave up on WiFi between the 2 ET8s and bought some TP-Link power line gigabit adapters. Has done wonders to keep upstairs stable.

On Thu, Apr 11, 2024, 3:26 PM adampk17 @.> wrote: I have 3 ET8's and I seem to be experiencing this issue on 3004.388.6_2-gnuton1. I can confirm downgrading to 3004.388.5_0-gnuton1 on my ET8's allowed me to set up AIMesh. — Reply to this email directly, view it on GitHub <#542 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABUJKQ6UOLPR2HRBE6TY5OTY44E3BAVCNFSM6AAAAABDQOEH4OVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANJQGY2TOOBWGQ . You are receiving this because you are subscribed to this thread.Message ID: @.>

I am using a wired backhaul currently and it's working as expected.

@CPngN
Copy link

CPngN commented Apr 12, 2024 via email

@Dodgydrains
Copy link

Dodgydrains commented Apr 12, 2024

For my use case 388.4 was more stable than 388.6. I had issues all day today with calls made over Microsoft Teams. Reverted back to stock official Asus FW 3.0.0.4.388_24621 to resolve this problem. I will wait till the next release. Looks like 388.6 is not working well on my XT8 setup.

@CPngN
Copy link

CPngN commented Apr 25, 2024 via email

@adampk17
Copy link

adampk17 commented May 9, 2024

Has anyone tried 3004.388.7_1 to see if this issue is resolved?

@craigbailey18
Copy link
Author

Not yet resolved. I am not sure it will be anytime soon. You may just have to leave the old firmware on the master node and user the new firmware on the points for now.

@adampk17
Copy link

Not yet resolved. I am not sure it will be anytime soon. You may just have to leave the old firmware on the master node and user the new firmware on the points for now.

Why do you think it won't be fixed any time soon?

@craigbailey18
Copy link
Author

craigbailey18 commented May 12, 2024

Well the bug has been present for close to 3 months that's all I was saying. I appreciate all the hard work that @gnuton does but this seems like a more complex issue.

@gnuton
Copy link
Owner

gnuton commented May 13, 2024

AIMesh is a closed component and it come as prebuilt binary from Asus.
I was not able to spot issues on my side.
As for now my understanding this will be fixed with the next GPL. The other option is to downgrade one part of the firmware to the working version of the prebuilt. But this not always is possible and I have not actually checked this yet. Another option is to use the xt8 prebuild, but I am not even sure if that's a good idea, since mismatching prebuilts can add instability on other side.

Tldr I am waiting for a fix from ASUS.

@craigbailey18
Copy link
Author

I understand @gnuton. That's why I was recommending to just use the older firmware for now.

@gnuton
Copy link
Owner

gnuton commented May 20, 2024

INVESTIGATION LOG

The reason why there are no mesh nodes dispayed comes from http://192.168.50.12/require/modules/amesh.js
get_cfg_clientlist.length is undefined

an hook in web.c is defined for this, this somehow returns null.
The values from this variable comes from the shared memory and they are fetched through shmget function

if you run the httpd server you will see this errors

gnuton@ZenWiFi_XT8-6490:/www# httpd
dict_size 269590
dict_item 4489
shmget failed

Other values can be fetched from the shared memory.
The functions that do not work are

{ "get_cfg_client_info", ej_get_cfg_client_info},
{ "get_cfg_clientlist", ej_get_cfg_clientlist},

This configuration comes from cfg_mnt which is a closed source component
and it's made of the following prebuilts.

asuswrt-merlin.ng/release/src/router/cfg_mnt/prebuild/RT-AX95Q$ ls
$ md5sum *
28bdc60ea1eb4b91924ab264918de60c cfg_client
4d681ca01c01c714df90c7127ac3c20d cfg_reportstatus
0d19ed561bcb5a2ffcbd5376d8a5a59a cfg_server
b6c02e412444d4dcca3165290570f1cf libcfgmnt.so

I have reverted all these files back to previous version.
I am not sure if that would build or works...
It's building right now.. https://github.com/gnuton/asuswrt-merlin.ng/actions/runs/9159344211

UPDATE1
Tested the builds with previous cfg binaries but that didn't help.
The problem should not be in any of these prebuilds

Tried to run cfg_server via SSH which displays several errrors
Got these errors:
gnuton@ZenWiFi_XT8-6490:/www#
json_object_from_file: error opening file /tmp/prelink.json: No such file or directory
json_object_from_file: error opening file /tmp/maclist.json: No such file or directory

reference to these files are only part of the cfg service there are no other binaries or code referncing them
so the problem to me must be there.

Somehow the cfg_server in the prebuild doesn't match the one in the new image created.
Maybe that's fine, but it must be checked.

gnuton@ZenWiFi_XT8-6490:/www# md5sum /usr/sbin/cfg_server
d69cfabadbe793e3744ac3f80a7d0257 /usr/sbin/cfg_server

@hulleyrob
Copy link

Is this expected to be fixed in the latest release?

@gnuton
Copy link
Owner

gnuton commented Jun 11, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Major Affects on of the main functionality of the router
Projects
None yet
Development

No branches or pull requests