ErrorCode< ERRDOM0002>:服务已崩溃,需要重新启动。 [英] ErrorCode<ERRDOM0002>:The service has crashed and requires a restart.

查看:165
本文介绍了ErrorCode< ERRDOM0002>:服务已崩溃,需要重新启动。的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述



我已经将我的应用程序与Velocity CTP 3一起部署到我的QA环境中。目前,这包含2个单节点集群(我知道这没有多大意义,但请耐心等待,我正在努力实现有意义的配置)。它使用sql server for config(每个集群具有不同的实例)。群集配置是:

<?xml version =" 1.0" encoding =" utf-8"?>
< configuration>
< configSections>
< section name =" dataCache" type =" Microsoft.Data.Caching.DataCacheSection,CacheBaseLibrary,Version = 1.0.0.0,Culture = neutral,PublicKeyToken = 89845dcd8080cc91" />
< / configSections>
< dataCache cluster =" cluster" size =" Small">
< caches>
< cache type =" partitioned"一致性= QUOT;强" name =" default">
< policy>
< eviction type =" lru" />
< expiration defaultTTL =" 10" isExpirable = QUOT;真" />
< / policy>
< / cache>
< cache type =" partitioned"一致性= QUOT;强" name =" NetfinanceState">
< policy>
< eviction type =" lru" />
< expiration defaultTTL =" 10" isExpirable = QUOT;真" />
< / policy>
< / cache>
< / caches>
< hosts>
< host clusterPort =" 22234" ; HOSTID = QUOT; 351319443"大小= QUOT; 512" quorumHost =" true"
name =" esweb01" cacheHostName = QUOT; DistributedCacheService" cachePort = QUOT; 22233" />
< host clusterPort =" 22234" HOSTID = QUOT; 351319443"大小= QUOT; 512" quorumHost =" true" name =" esweb02" cacheHostName = QUOT; DistributedCacheService" cachePort = QUOT; 22233" />
< / hosts>
< advancedProperties>
< partitionStoreConnectionSettings providerName =" System.Data.SqlClient"
connectionString =" Data Source = xxx;初始目录=速度;用户ID = xxx;密码= xxx"
leadHostManagement =" false" />
< / advancedProperties>
< / dataCache>
< / configuration>

在每台机器上使用不同的连接字符串,

和缓存配置为:
缓存名称:缓存
TTL:10
类型:分区
辅助:0
可伸出:正常
EvictionType:LRU
NotificationsEnabled:False

现在我的问题是:这个曾经工作过,直到几天前我们停电,并且由于服务器已经备份,我得到一个每个节点每7-9分钟一次:

事件类型:错误
事件来源:DistributedCache
事件类别:无事件ID:0
日期:11/12/2009
时间:09:52:13
用户:N / A
计算机:ESWEB02
说明:
Microsoft.Data。卡奇ng.DataCacheException:ErrorCode< ERRDOM0002>:服务已崩溃并需要重新启动。在Microsoft.Data.Caching.DOMNode.OnNodeFailed(Object sender,EventArgs e)
在System.Data.Cas。 Main.CasNode.OnNodeFailed(Object sender,EventArgs e)
在System.Data.Fabric.Common.SharedCommunicationObject.Fail(EventArgs arg)
在System.Data.Fabric.Rendezvous.SiteNode.Fail(EventArgs) arg)
在System.Data.Fabric.Rendezvous.SiteNode.ProcessExternalStoreExpiration(Object state)
在System.Data.Fabric.Common.Timer.OnTimerElapsed()
在System.Data.Fabric .Common.TimerQueue.FireTimerCallback(Object state)
在System.Data.Fabric.Common.IOCompletionPortWorkQueue.Invoke(WaitCallback callback,Object state)
在System.Data.Fabric.Common.IOCompletionPortWorkQueue.WorkerThreadStart( )在System.Threading.ExecutionContext.Run(ExecutionContext executionContext,ContextCallback回调,对象状态)
在Sy stem.Threading.ThreadHelper.ThreadStart()

有关详细信息,请参阅http://go.microsoft.com/fwlink/events.asp上的"帮助和支持中心"。


它们不会同时发生在两台机器上。
我在这个论坛上发现了另一个类似的例外问题,答案是它可能与sql server不可用有关。但是,我已经检查了它,我怀疑这是我的情况中的问题。你能否告诉我还有什么可能导致这种例外的线索?

谢谢,

解决方案

嗨丹,

这两个节点是否在同一个集群配置中?根据您的方案,这些主机配置了不同的sqlserver实例。

如果主机ID位于同一个sql server中,它们应该是不同的。

< hosts>
< host clusterPort =" 22234" hostId =" 351319443 "大小= QUOT; 512" quorumHost =" true"
name =" esweb01" cacheHostName = QUOT; DistributedCacheService" cachePort = QUOT; 22233" />
< host clusterPort =" 22234" hostId =" 351319443 "大小= QUOT; 512" quorumHost =" true" name =" esweb02" cacheHostName = QUOT; DistributedCacheService" cachePort = QUOT; 22233" />
< / hosts>

如果两台主机在同一个配置文件中,您能解释一下您想要实现的目标吗?它看起来像一个重叠的集群(并且不受支持)。

你能否启用服务器日志(详细)并邮寄给我(sujorwek @ microsoft.com)

谢谢,
Sudhir

Hi

I have deployed my app with Velocity CTP 3 to my QA environment. At the moment this consists in 2 single node clusters (I know this doesn't make much sense, but please bear with me, I am working my way to a meaningful configuration). It uses sql server for config (each cluster with a different instance). The cluster configuration is:

<?xml version="1.0" encoding="utf-8"?>
<configuration>
    <configSections>
        <section name="dataCache" type="Microsoft.Data.Caching.DataCacheSection, CacheBaseLibrary, Version=1.0.0.0, Culture=neutral, PublicKeyToken=89845dcd8080cc91" />
    </configSections>
    <dataCache cluster="cluster" size="Small">
        <caches>
            <cache type="partitioned" consistency="strong" name="default">
                <policy>
                    <eviction type="lru" />
                    <expiration defaultTTL="10" isExpirable="true" />
                </policy>
            </cache>
            <cache type="partitioned" consistency="strong" name="NetfinanceState">
                <policy>
                    <eviction type="lru" />
                    <expiration defaultTTL="10" isExpirable="true" />
                </policy>
            </cache>
        </caches>
        <hosts>
            <host clusterPort="22234" hostId="351319443" size="512" quorumHost="true"
                name="esweb01" cacheHostName="DistributedCacheService" cachePort="22233" />
            <host clusterPort="22234" hostId="351319443" size="512" quorumHost="true"
                name="esweb02" cacheHostName="DistributedCacheService" cachePort="22233" />
        </hosts>
        <advancedProperties>
            <partitionStoreConnectionSettings providerName="System.Data.SqlClient"
                connectionString="Data Source=xxx;Initial Catalog=Velocity;User Id=xxx;Password=xxx"
                leadHostManagement="false" />
        </advancedProperties>
    </dataCache>
</configuration>

with a different connection string on each machine,

and the cache config is:
CacheName            : Cache
TTL                  : 10
Type                 : Partitioned
Secondaries          : 0
IsExpirable          : True
EvictionType         : LRU
NotificationsEnabled : False

Now my problem is: this used to work, until a couple days ago when we had a power cut, and since the servers are back up I get one of these every 7-9 minutes, on each node:

Event Type:    Error
Event Source:    DistributedCache
Event Category:    None
Event ID:    0
Date:        11/12/2009
Time:        09:52:13
User:        N/A
Computer:    ESWEB02
Description:
Microsoft.Data.Caching.DataCacheException: ErrorCode<ERRDOM0002>:The service has crashed and requires a restart.
   at Microsoft.Data.Caching.DOMNode.OnNodeFailed(Object sender, EventArgs e)
   at System.Data.Cas.Main.CasNode.OnNodeFailed(Object sender, EventArgs e)
   at System.Data.Fabric.Common.SharedCommunicationObject.Fail(EventArgs arg)
   at System.Data.Fabric.Rendezvous.SiteNode.Fail(EventArgs arg)
   at System.Data.Fabric.Rendezvous.SiteNode.ProcessExternalStoreExpiration(Object state)
   at System.Data.Fabric.Common.Timer.OnTimerElapsed()
   at System.Data.Fabric.Common.TimerQueue.FireTimerCallback(Object state)
   at System.Data.Fabric.Common.IOCompletionPortWorkQueue.Invoke(WaitCallback callback, Object state)
   at System.Data.Fabric.Common.IOCompletionPortWorkQueue.WorkerThreadStart()
   at System.Threading.ExecutionContext.Run(ExecutionContext executionContext, ContextCallback callback, Object state)
   at System.Threading.ThreadHelper.ThreadStart()

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.


They don't happen at the same time on both machines.
I found another question on this forum with a similar exception and the answer was that it could be related to the sql server being unavailable. However, I have checked it and I doubt that is the problem in my situation. Could you give me any clues on what else could cause this exception?

Thanks,
Dan

解决方案

Hi Dan,

Are these two nodes in the same cluster configuration? As per your scenario these hosts are configured with different sqlserver instances.

The host ids should be different if they are in the same sql server.

     <hosts>
            <host clusterPort="22234" hostId="351319443" size="512" quorumHost="true"
                name="esweb01" cacheHostName="DistributedCacheService" cachePort="22233" />
            <host clusterPort="22234" hostId="351319443" size="512" quorumHost="true"
                name="esweb02" cacheHostName="DistributedCacheService" cachePort="22233" />
        </hosts>

If the two hosts are in the same config file, could you please explain what are you trying to achieve? It seems like a overlapping cluster (and it is not supported).

Could you please enable server logs (verbose) and mail it to me (sujorwek @ microsoft . com)

thanks,
Sudhir


这篇关于ErrorCode&lt; ERRDOM0002&gt;:服务已崩溃,需要重新启动。的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
相关文章
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆