This is a discussion on CIC and deadlocks within the pgsql Hackers forums, part of the PostgreSQL category; --> Isn't CREATE INDEX CONCURRENTLY prone deadlock conditions ? I saw one with VACUUM today. But I think it can ...
| |||||||
| FAQ | Members List | Calendar | Search | Today's Posts | Mark Forums Read |
| ||||
| Isn't CREATE INDEX CONCURRENTLY prone deadlock conditions ? I saw one with VACUUM today. But I think it can happen with other commands like VACUUM FULL, CLUSTER, CREATE INDEX CONCURRENTLY and so on. These commands conflict on the ShareUpdateExclusiveLock held by CIC and hence would wait for CIC to release the lock. At the same time, CIC would wait for these transactions to complete. We know that these commands are run in a separate transaction and do not do any index fetches or inserts/updates. So in principle CIC need not wait for these transactions to complete in any of its waits. May be we can skip waits on the transactions that are running one of these commands. Is it something worth doing ? Thanks, Pavan -- EnterpriseDB http://www.enterprisedb.com |
| |||
| On Sat, 2007-03-31 at 17:45 +0530, Pavan Deolasee wrote: > > Isn't CREATE INDEX CONCURRENTLY prone deadlock conditions ? > I saw one with VACUUM today. But I think it can happen with other > commands like VACUUM FULL, CLUSTER, CREATE INDEX > CONCURRENTLY and so on. These commands conflict on the > ShareUpdateExclusiveLock held by CIC and hence would wait for > CIC to release the lock. At the same time, CIC would wait for these > transactions to complete. > > We know that these commands are run in a separate transaction > and do not do any index fetches or inserts/updates. So in principle > CIC need not wait for these transactions to complete in any > of its waits. May be we can skip waits on the transactions that > are running one of these commands. Yes, because I proposed it already. :-) "utility transactions" in - Latest plans for Utilities with HOT -- Simon Riggs EnterpriseDB http://www.enterprisedb.com ---------------------------(end of broadcast)--------------------------- TIP 5: don't forget to increase your free space map settings |
| |||
| "Pavan Deolasee" <pavan.deolasee@gmail.com> writes: > Isn't CREATE INDEX CONCURRENTLY prone deadlock conditions ? Can you give a specific example? The deadlock code will grant locks out-of-order in cases where the alternative is to abort somebody. I think you may be describing a missed opportunity in that logic, more than a reason to add still another fragile assumption for HOT. regards, tom lane ---------------------------(end of broadcast)--------------------------- TIP 5: don't forget to increase your free space map settings |
| |||
| On 3/31/07, Tom Lane <tgl@sss.pgh.pa.us> wrote: > > "Pavan Deolasee" <pavan.deolasee@gmail.com> writes: > > Isn't CREATE INDEX CONCURRENTLY prone deadlock conditions ? > > Can you give a specific example? txn1 - CREATE INDEX CONCURRENTLY (takes ShareUpdateExclusiveLock) txn2 - VACUUM ANALYZE (waits on ShareUpdateExclusiveLock) tnx1 - waits for txn2 to complete in the second phase of CIC deadlock! Lazy VACUUM is safe because we don't include "inVacuum" transactions in the snapshot and hence don't wait for it in CIC. I haven't checked, but VACUUM FULL would also deadlock. > I think you may be describing a missed opportunity in that logic, > more than a reason to add still another fragile assumption for HOT. Not sure what you are referring to. But I shall keep this in mind. Thanks, Pavan -- EnterpriseDB http://www.enterprisedb.com |
| |||
| "Pavan Deolasee" <pavan.deolasee@gmail.com> writes: > On 3/31/07, Tom Lane <tgl@sss.pgh.pa.us> wrote: >> Can you give a specific example? > txn1 - CREATE INDEX CONCURRENTLY (takes ShareUpdateExclusiveLock) > txn2 - VACUUM ANALYZE (waits on ShareUpdateExclusiveLock) > tnx1 - waits for txn2 to complete in the second phase of CIC Oh, it's the cleanup wait you're worried about. > Lazy VACUUM is safe because we don't include "inVacuum" transactions > in the snapshot and hence don't wait for it in CIC. Hmm ... only if it's already set inVacuum true ... there's a window where it has not. I wonder whether we could change CIC so that the "reference snapshot" lists only transactions that are running and have already determined their serializable snapshot (ie, have set proc->xmin). Xacts that haven't yet done that can be ignored because they couldn't possibly see the dead tuples we're worried about, no? Then we could rearrange the order of operations in vacuum_rel so that we lock the target rel before we acquire a snapshot. Then a vacuum waiting for the CIC cannot cause a deadlock. Multi-rel CLUSTER could be changed the same way. I'm not particularly worried about single-rel CLUSTER, only stuff that would be reasonable to launch from background maintenance tasks. [ thinks... ] Actually, it seems risky to omit xids from the reference snapshot; that could perhaps screw up the index insertions. But we could look in the procArray to see if the xid still exists and has set an xmin before we actually wait for it. regards, tom lane ---------------------------(end of broadcast)--------------------------- TIP 1: if posting/reading through Usenet, please send an appropriate subscribe-nomail command to majordomo@postgresql.org so that your message can get through to the mailing list cleanly |
| |||
| On 3/31/07, Tom Lane <tgl@sss.pgh.pa.us> wrote: > > > Hmm ... only if it's already set inVacuum true ... there's a window > where it has not. > > I wonder whether we could change CIC so that the "reference > snapshot" lists only transactions that are running and have already > determined their serializable snapshot (ie, have set proc->xmin). > Xacts that haven't yet done that can be ignored because they couldn't > possibly see the dead tuples we're worried about, no? Yes, it may work. Do we need to take some extra care because proc-xmin is set while holding SHARED lock on proc array ? Then we could rearrange the order of operations in vacuum_rel so > that we lock the target rel before we acquire a snapshot. Then > a vacuum waiting for the CIC cannot cause a deadlock. We may need to do the same in analyze_rel. Thanks, Pavan -- EnterpriseDB http://www.enterprisedb.com |
| |||
| "Pavan Deolasee" <pavan.deolasee@gmail.com> writes: > Yes, it may work. Do we need to take some extra care because > proc-xmin is set while holding SHARED lock on proc array ? Good point. I'm envisioning a procarray.c function along the lines of bool TransactionHasSnapshot(xid) which returns true if the xid is currently listed in PGPROC and has a nonzero xmin. CIC's cleanup wait loop would check this and ignore the xid if it returns false. Your point means that this function would have to take exclusive not shared lock while scanning the procarray, which is kind of annoying, but it seems not fatal since CIC isn't done all that frequently. regards, tom lane ---------------------------(end of broadcast)--------------------------- TIP 2: Don't 'kill -9' the postmaster |
| |||
| Tom Lane wrote: > "Pavan Deolasee" <pavan.deolasee@gmail.com> writes: > > Good point. I'm envisioning a procarray.c function along the > lines of > bool TransactionHasSnapshot(xid) > which returns true if the xid is currently listed in PGPROC > and has a nonzero xmin. CIC's cleanup wait loop would check > this and ignore the xid if it returns false. Your point means > that this function would have to take exclusive not shared lock > while scanning the procarray, which is kind of annoying, but > it seems not fatal since CIC isn't done all that frequently. > Tom, If you haven't finished this yet, would you like me to work on this ? If I do it, I would mostly follow the path you suggested above, unless I run into something else. Thanks, Pavan -- EnterpriseDB http://www.enterprisedb.com ---------------------------(end of broadcast)--------------------------- TIP 9: In versions below 8.0, the planner will ignore your desire to choose an index scan if your joining column's datatypes do not match |
| ||||
| "Pavan Deolasee" <pavan.deolasee@enterprisedb.com> writes: > If you haven't finished this yet, would you like me to work > on this ? If I do it, I would mostly follow the path you > suggested above, unless I run into something else. I'm not intending to work on it. regards, tom lane ---------------------------(end of broadcast)--------------------------- TIP 4: Have you searched our list archives? http://archives.postgresql.org |
| Thread Tools | |
| Display Modes | |
|
|