Scheduler and Polling quits after some minutes, hours, or days

brbeaird · June 15, 2015, 7:33pm

Tried this already. Aaron S. gave me some suggestions but ended up letting me know that the engineers are aware of the issue and working on it.

pstuart · June 15, 2015, 9:13pm

Yeah, this is the problem. A lot of smartapp’s use the schedule or runIn functions to run one time and in that function schedule the next instance.

This is why most are failing dead and not resuming.

ST has a run every X and CRON based scheduling that should be running all the time. These should survive a platform reset, but there seems to be isolated cases where they just stop until re-initialized.

I almost am tempted to sent up OAUTH endpoints for all my Schedule based smartapps and ping them once a day and call a function to unschedule and re-schedule my actions, which would “reset” CRON based scheduling.

brbeaird · June 15, 2015, 9:19pm

Well, if all you’re wanting is a once per day check, I’d go with copyninja’s method of letting sunrise/sunset kick off that action via subscribe.

subscribe(location, "sunrise", runRefresh)
subscribe(location, "sunset", runRefresh)

Otherwise, I’m with you in having an external OAUTH endpoint schedule trigger a check to reset the CRON. I got that working manually as a proof of concept. It’s kinda messy, though, so I’ve been hesitant to automate putting it in.

625alex · June 15, 2015, 9:35pm

Just for record, run everyXminutes fails for me.

tgauchat · June 15, 2015, 9:35pm

If the “system wide, but localized” sunrise/sunset events are occurring reliably, then why isn’t all scheduling reliable?!

brbeaird · June 15, 2015, 9:39pm

Honestly, I haven’t tried this, so I guess I can’t technically confirm that it is reliable. However, most of my stock ST actions based on sunrise/sunset do work more often than not, so I assumed it was being handled slightly differently. You could also subscribe to mode changes, though.

subscribe(location, "mode", runRefresh)

tgauchat · June 15, 2015, 9:42pm

Sure … but… Most folks use a schedule (such as sunset) to change modes; so we’re in an endless loop here.

brbeaird · June 15, 2015, 9:43pm

Fair enough. I leave for work at least once a day, though, which triggers my mode to change without a schedule involved.

btk · June 16, 2015, 1:22am

I’ve done this multiple times. The outcomes have been:

Ignored for months on end
“It’s being looked at by someone else”
It was “an isolated issue”
There were “some platform issues with logging” that day.

I’m sure you can understand why I chuckled when I read your post.

I can send you the log from my monitoring system of when it’s had to restart each of my scheduled apps. Between the 5 different SmartApps I’m monitoring/restarting automatically, there have been 888 restarts since 5/29. Heck, I can have it e-mail support each time one of them crashes if it’d help.

btk · June 16, 2015, 1:23am

I started with this, but they’re crashing several times per day on average for me. I needed something a bit more proactive since a lot of what I depend on the schedules for is logging.

smart · June 16, 2015, 1:53am

With due respect and as a total ST supporter from rock bottom of my heart how many tickets can members open again and again and again. Please do not take it as a sarcasm. We do care. You cannot simply focus on v2 only unless you get the current platform stable.

Aaron · June 16, 2015, 3:13am

@brbeaird - sometimes you dont even need to @mention me… the mere typing of ‘aaron’ will work! (actually, I have been following this thread because this is really important to us).

@btk I had a fire drill with one of our problem tracking tags last week because it wasn’t clear all if the scheduled event failure tickets were getting recorded. We are good now. I also saw you had an open ticket that is in the queue for engineer review. I sent a request for someone to review and will try to nudge it again tomorrow. Can you shoot a follow up note to support@smartthings.com an updated example of a failed SmartApp with the following:

Name of SmartApp
Approximate time/date of failure
Expected automation to occur

The more reports of failures that we have, the better Support can help @beckje01 and his team of engineers (who are almost as good looking as Support) can isolate the root cause of the issues.

April · June 16, 2015, 3:16am

Ron - Hey… I get it. We keep asking for you to submit tickets, and at the surface level, it seems nothing is done. I understand the frustrations. It really helps with the priorities, if those tickets go in. We know what to focus on first, if there’s issues to chose from.

Not everyone is working on v2. Some are. I can attest that people are working overdrive to fix, build, and repair v1 experience.

brbeaird · June 16, 2015, 3:21am

Ha! Awesome. Thanks for checking in!

tgauchat · June 16, 2015, 3:23am

*I’ve probably asked for this a few times … well, I know I have. *

If there is some way that you can publish an “Open Issues List”, then we can refer to the Issue Number in our Support Request (if we suspect it is related to the particular “open issue”.

If we don’t know or cannot match an existing “open issue”, then Tech Support could refer us to the “open issue” list that we can continue to track offline.

Would you consider this?

I’ll presume that any “Likes” on this post represent agreement and support of the idea.

April · June 16, 2015, 3:25am

Terry. It’s on my list of things to implement. We know this. We’re trying to integrate a upvote system service without having members to use another service just for that.

I agree- a open issues list is HUGE. It’s one of my priorities to get it done.

tgauchat · June 16, 2015, 3:31am

Super appreciate that you concur with the high value of this and have made it a priority!

Considering that it it is a “high priority item”…

I’d suggest that the slight inconvenience of “yet another service / tool” is worth overlooking.
Frankly, even a shared read-only Google Spreadsheet would be sufficient to see open issues with some short of reference number. Of course, there are plenty of bug-tracking platforms out there, but … well … don’t let the lack of a hammer stop you from using a using a brick to bang in this particular nail; if you understand that crazy metaphor.

JDRoberts · June 16, 2015, 3:35am

I reported that the Big Switch had failed, got told my hub has probably just lost connection with the Hue Bridge and to reinstall.

Which will fix the problem temporarily, but doesn’t solve the schedule dying, which I suspect is what happened.

2 weeks earlier I reported that my good morning Hello Home Action had failed to fire. I was told a server had “hiccuped” and to delete and re add it. Which again, solves the problem temporarily but doesn’t address the schedule failure.

3 weeks before that I reported that Sleepytime failed. I was told to uninstall and re add it.

See the pattern? Because the delete/reinstall corrects that particular failure for that particular device, my guess is that it probably never gets added to the scheduler failed stats, even though none of this was custom code.

But the QOS issue remains the same.

April · June 16, 2015, 3:42am

Ahh . Thanks for your wonderful suggestion!

With more manual work, just remember, that’s less time actually developing and implementing. My number one priority and primary focus is to get the platform stable. My second one is in developer experience: people worked hard making smartapps solutions that were sitting in a queue. That’s gone.
Oh, the joys of growing pains, hiring and recruiting. On the upside, I’m seeing about 5-10 hires a week/2weeks.

One more thing - We’re all one big family, and company - though community and support are two separate teams. Although I hear the needs- I’ll also need the assistance from the support team. Tyler H and Ryan totally hears me on this. We know we need this. We’re all on board.

tgauchat · June 16, 2015, 3:46am

You’ll get to me eventually… but I’ll be snapped up by then :

Scheduler and Polling quits after some minutes, hours, or days

Considering that it it is a “high priority item”…

Customers

Developers

Download the SmartThings App