View Single Post

   
  #2 (permalink)  
Old 04-20-2008, 07:22 AM
Madison Pruet
 
Posts: n/a
Default Re: error cdr define repl

Since this is an error while creating a sparse control message, I suspect
that the problem is being encountered trying to forward the control message
to one of the leaf servers. Are you defining the replicate on the root
node? Is the error occuring on the root node or on one of the non-root
servers?

We may have to isolate the problem just a bit. One way would be to define
the replicate with a minimal number of nodes, and then use the cdr change
replicate command to add participants. That way we could narrow the problem
down a bit.

Since this is 9.2x, you aren't using the smartblob to provide disk storage
for the queues. The control queue has it's own queue table and ALL control
messages must be placed into stable storage. Therefor, it is possible that
the control queue can't write the control message to disk because of some
basic DB issue - possibably the disk storage space is exausted. Don't know
for sure - just guessing.

If that is the case, then you might not see the problem by using 'cdr change
replcate' because you would be generating smaller control messages, and they
might fit into the disk space.

You do realize that 9.2 is a fairly old release... (enough said about
that...) - ;-)


Could you please send me some information about your site? I'm guessing
that you're using a three level replication topology. I try to keep track
of the more interesting customer sites ---- thanks...

M.P.
"cristizaharioiu" <cristizaharioiu@gmail.com> wrote in message
news:1105349651.722628.220830@c13g2000cwb.googlegr oups.com...
> hello all!
>
> I try to define a new replicate using cdr define repl and I receive
> next error in online.log:
>
> 11:14:06 CDR Queuer: rqmQueueTxn (1000/0/0x2886/0x0) failed on CNTRL
> 11:14:06 CDR GC: operation sparse control message queue failed (error
> 0).
>
> the command is:
>
> cdr define repl -C ignore --scope=transaction --immed --ats --ris
> --floatcanon rnom_g_cursvalext \
> "P bd_sco00@g_sco00:informix.curs_valutar_extins" "select * from
> curs_valutar_extins" \
> "R bd_qbank_ghiseu@g_sco40:informix.curs_valutar_extins" "select *
> from curs_valutar_extins" \
> "R bd_qbank_ghiseu@g_sco42:informix.curs_valutar_extins" "select *
> from curs_valutar_extins" \
> "R bd_qbank_ghiseu@g_sco43:informix.curs_valutar_extins" "select *
> from curs_valutar_extins" \
> ...........
> ............
> "R bd_qbank_ghiseu@g_sco122:informix.curs_valutar_extins" "select *
> from curs_valutar_extins"
>
> When I defined replicate shell (/bin/bash) no error returned; the
> error appear only in online.log
>
> We have one root server, 22 nonroot server, 60 leaf server and about
> 700 replicates already defined; all works ok.
> We are running Unixware 7.1 and IDS 9.2
>



Reply With Quote