On July 25, 2016 we experienced an incident that involved the loss of SmartApp states. We realize that this had an impact on a subset of our valued customers and developers, and sincerely apologize to anyone who was impacted by this incident. We value your continued support during our growth to beco…

Tim, thanks for this update. It is much appreciated…,

I agree. The incident report glances over the fact that data was lost and there was no attempt to recover it. What is ST doing in regards to data recovery and why wasn’t an option 3-4 hours in when the problem was recognized?

[image] Mbhforum: 3-4 hours in when the problem was recognized? Because they didn’t offer @ady624 that position yet, to my knowledge:-)

wow thank your for posting this @slagle looking forward to the continued improvement

I am very interested in WHY the engineers that are actually there with the hardware have yet to develop a state recovery process. Especially when a community members had a working smartapp state recovery process built, tested, and published within 12 hours. You guys are the professionals. You hav…

[image] Failures again General Discussion Looks like we had a small spike around the time of this post, but everything stabilized minutes later. I’ll keep an eye on it the rest of the night. I’ll have an engineer look into the spike tomorrow if everything stays normal ton…

Yup, my modes have not been changing for the past 24 hours.

I have an idea, why don’t you create a special region for volunteer? With that, you can always test yr roll out on it first, in real user, advance user, give them some token in return :wink: Many would be happy to volunteer on that special region, anyway without choosing to be in that, we are facin…

That’s a good idea, but also note, instead of rolling out one after another to each shard, why wouldn’t the roll go to one shard for a period of 24 hours, then the next, etc? That being said, I am just guessing here (or second guessing as it may be)… I am more interested in why the report is defic…

I’ll say this. From my experience in code deployment to a large deployment base… You can test and test and load test and user acceptance test and more load test until you are blue in the face. Production and qa/load/uat whatever all are ‘in sync’. Going to prod is always different. Something co…

Incident Report - SmartApp state - July 25, 2016

General Discussion

Aaron (Aaron S) August 8, 2016, 12:21pm 15

We are still experiencing some server hot spots and have a team working on this. Keep reporting failures to support@ so we can trend issues and pull logs for investigation.

Topic		Replies	Views
Loss of State affecting some users / some SmartApps (was Death knell for RM today?) General Discussion core , smarttiles , rulemachine	371	16367	November 4, 2016
Weekly Update from Alex - 06/09/16 Announcements	75	11317	July 3, 2016
Smart apps not working tonight? SmartApps & Automations	35	4443	June 23, 2014
T- 48 hours and nothing but crickets from Smartthings General Discussion support	177	11493	March 21, 2016
SmartThings Downtime and What We're Doing Announcements	185	14739	November 21, 2014

Incident Report - SmartApp state - July 25, 2016

Related topics