Smalltalk › Gemtalk › GLASS

Transaction Logs so large ....

_‹ Previous Topic Next Topic _›

Classic

List

Threaded

35 messages Options

GLASS mailing list

Transaction Logs so large ....

One of the most surprising things I noticed with my application is the tremendous storage need of my transaction logs.

Whenever I test my application (with around 200 clients) I get one transaction log file within 4 minutes - each with a size of 1 GB. That would lead to 120GB transaction logs within one days with 8 hours working time.

And of course I do full transaction log .... the database size after doing a backup is 3.4 GB. Normally the clients do not do so much work - but they send lots of events ...

I looked at the inventory and I did not find any strange things ... any ideas how to get an idea why such an amount of space is needed for the transaction logs ....

How do classes like Set, Arrays and changes to them results in transaction logs changes ??? I prefer to use Arrays at this moment ...

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

Ok, I have to make this perhaps a little bit clearer:

In my test cases I have 200 to 400 clients (written in Python) playing their role as client simulators. They do their requests via http/rest against Gemstone/s against 8 responding gem/zinc tasks (running with different configurations). They produce in total (10-20) events per seconds against the database. These events are changing or adding data - no deletion at this point.

In terms of speed Gemstone has no problem answering the requests (even with only one answering tasks it would work).

The current database file is around 64 GB of size.

The size of the uncompressed backup file is around 4 GB.

As I mentioned in my older postings: this system produces 1GB transaction log files within 4-5 minutes.Therefore we have around 100GB transaction data each day ... wow.

This seems to indicate, that not the extents files will be a problem, but the management of the transaction files is far more difficult to handle (and I was not aware of this :-))).

Question to the more experienced Gemstone/S developers: is this ok (a general pattern) ?

Marten Feldtmann via Glass <[hidden email]> hat am 22. Oktober 2016 um 22:08 geschrieben:

One of the most surprising things I noticed with my application is the tremendous storage need of my transaction logs.
Whenever I test my application (with around 200 clients) I get one transaction log file within 4 minutes - each with a size of 1 GB. That would lead to 120GB transaction logs within one days with 8 hours working time.

And of course I do full transaction log .... the database size after doing a backup is 3.4 GB. Normally the clients do not do so much work - but they send lots of events ...
I looked at the inventory and I did not find any strange things ... any ideas how to get an idea why such an amount of space is needed for the transaction logs ....
How do classes like Set, Arrays and changes to them results in transaction logs changes ??? I prefer to use Arrays at this moment ...

_______________________________________________ Glass mailing list [hidden email] http://lists.gemtalksystems.com/mailman/listinfo/glass

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

I suspect it is the persistence of the seaside continuations, with all
the temps and things that go with it. We have the same problem; our
number of users are picking up.

I believe that there is a way to make the continuations transient,
which we have not tried.

We load balance across gemstone sessions, which may influence what you do.

On Mon, Oct 24, 2016 at 10:48 AM, Marten Feldtmann via Glass
<[hidden email]> wrote:

> Ok, I have to make this perhaps a little bit clearer:
>
> In my test cases I have 200 to 400 clients (written in Python) playing their
> role as client simulators. They do their requests via http/rest against
> Gemstone/s against 8 responding gem/zinc tasks (running with different
> configurations). They produce in total (10-20) events per seconds against
> the database. These events are changing or adding data - no deletion at this
> point.
>
> In terms of speed Gemstone has no problem answering the requests (even with
> only one answering tasks it would work).
>
> The current database file is around 64 GB of size.
>
> The size of the uncompressed backup file is around 4 GB.
>
> As I mentioned in my older postings: this system produces 1GB transaction
> log files within 4-5 minutes.Therefore we have around 100GB transaction data
> each day ... wow.
>
> This seems to indicate, that not the extents files will be a problem, but
> the management of the transaction files is far more difficult to handle (and
> I was not aware of this :-))).
>
> Question to the more experienced Gemstone/S developers: is this ok (a
> general pattern) ?
>
>
> Marten Feldtmann via Glass <[hidden email]> hat am 22.
> Oktober 2016 um 22:08 geschrieben:
>
>
> One of the most surprising things I noticed with my application is the
> tremendous storage need of my transaction logs.
>
> Whenever I test my application (with around 200 clients) I get one
> transaction log file within 4 minutes - each with a size of 1 GB. That would
> lead to 120GB transaction logs within one days with 8 hours working time.
>
>
> And of course I do full transaction log .... the database size after doing a
> backup is 3.4 GB. Normally the clients do not do so much work - but they
> send lots of events ...
>
> I looked at the inventory and I did not find any strange things ... any
> ideas how to get an idea why such an amount of space is needed for the
> transaction logs ....
>
> How do classes like Set, Arrays and changes to them results in transaction
> logs changes ??? I prefer to use Arrays at this moment ...
>
>
>
>
> _______________________________________________ Glass mailing list
> [hidden email]
> http://lists.gemtalksystems.com/mailman/listinfo/glass
>
>
> _______________________________________________
> Glass mailing list
> [hidden email]
> http://lists.gemtalksystems.com/mailman/listinfo/glass
>

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

Tranlog files at first have transaction sequence not yet represented in the random-access database extent files. Sequential files are faster to update and changes can be replayed to extents of different ages (like from restore or cloning). Tranlog files that only contain transactions older than the last Checkpoint Commit can be archived. A checkpoint commit is the point where extents now contain the tranlog changes. You do not need to keep full tranlog files older than the checkpoint commit record in the extents backup that you may wish restore. GS DBAs write shell scripts to manage these things, knowing what tranlog files are no longer useful to keep.

If you already know this stuff then your question would be related to checkpoint commits not happening in a timely manner. How that can happen and how to improve that is a much deeper discussion, but is usually remedied by changing gems to not stay with old views of the database. That can mean more frequent aborts for example, and starting a transaction a short time before making changes to commit. Just one gem staying in-transaction for a long time can hold up checkpoint commits. A developer logged into (and in-transaction) a very active database can hold up checkpoints causing a backlog of transactions not yet applied to extent files.

Paul Baumann

On Oct 24, 2016 5:20 AM, "Otto Behrens via Glass" <[hidden email]> wrote:

I suspect it is the persistence of the seaside continuations, with all
the temps and things that go with it. We have the same problem; our
number of users are picking up.

I believe that there is a way to make the continuations transient,
which we have not tried.

We load balance across gemstone sessions, which may influence what you do.

On Mon, Oct 24, 2016 at 10:48 AM, Marten Feldtmann via Glass
<[hidden email]> wrote:
> Ok, I have to make this perhaps a little bit clearer:
>
> In my test cases I have 200 to 400 clients (written in Python) playing their
> role as client simulators. They do their requests via http/rest against
> Gemstone/s against 8 responding gem/zinc tasks (running with different
> configurations). They produce in total (10-20) events per seconds against
> the database. These events are changing or adding data - no deletion at this
> point.
>
> In terms of speed Gemstone has no problem answering the requests (even with
> only one answering tasks it would work).
>
> The current database file is around 64 GB of size.
>
> The size of the uncompressed backup file is around 4 GB.
>
> As I mentioned in my older postings: this system produces 1GB transaction
> log files within 4-5 minutes.Therefore we have around 100GB transaction data
> each day ... wow.
>
> This seems to indicate, that not the extents files will be a problem, but
> the management of the transaction files is far more difficult to handle (and
> I was not aware of this :-))).
>
> Question to the more experienced Gemstone/S developers: is this ok (a
> general pattern) ?
>
>
> Marten Feldtmann via Glass <[hidden email]> hat am 22.
> Oktober 2016 um 22:08 geschrieben:
>
>
> One of the most surprising things I noticed with my application is the
> tremendous storage need of my transaction logs.
>
> Whenever I test my application (with around 200 clients) I get one
> transaction log file within 4 minutes - each with a size of 1 GB. That would
> lead to 120GB transaction logs within one days with 8 hours working time.
>
>
> And of course I do full transaction log .... the database size after doing a
> backup is 3.4 GB. Normally the clients do not do so much work - but they
> send lots of events ...
>
> I looked at the inventory and I did not find any strange things ... any
> ideas how to get an idea why such an amount of space is needed for the
> transaction logs ....
>
> How do classes like Set, Arrays and changes to them results in transaction
> logs changes ??? I prefer to use Arrays at this moment ...
>
>
>
>
> _______________________________________________ Glass mailing list
> [hidden email]
> http://lists.gemtalksystems.com/mailman/listinfo/glass
>
>
> _______________________________________________
> Glass mailing list
> [hidden email]
> http://lists.gemtalksystems.com/mailman/listinfo/glass
>
_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

> If you already know this stuff then your question would be related to
> checkpoint commits not happening in a timely manner. How that can happen and
> how to improve that is a much deeper discussion, but is usually remedied by
> changing gems to not stay with old views of the database. That can mean more
> frequent aborts for example, and starting a transaction a short time before
> making changes to commit. Just one gem staying in-transaction for a long
> time can hold up checkpoint commits. A developer logged into (and
> in-transaction) a very active database can hold up checkpoints causing a
> backlog of transactions not yet applied to extent files.

I understand that if there are gems holding onto old views with other
gems committing a lot, it may happen that more garbage than usual will
end up in the extent and the gems will take longer to commit because
they have more work to determine the write set union etc. I suppose if
things start to overflow one could create bigger and bigger
transaction logs because a gem would be committing temporary objects
to the repository.

If one install proper SigAbort handlers for gems then the gems should
be catching them and aborting, unless there are way more commits than
what the gems can react to via sig aborts? Is it in this case that the
condition may occur?

In our application, we start a number of gems that service seaside
requests. These gems (as well as the other gems we run in the
background), all have sig abort handlers. The load balancing also
helps to ensure that the seaside based application distributes work to
idle gems. So in my view, gems should be cooperating well.

In spite of this, we still see 10GB of tranlogs created in a few hours
under low load. (The db is around 23GB).

I suppose the only way to really analyse what's going on is to run
statmonitor and understand the output. From my previous attempts
though, it appears as if the Seaside infrastructure is really heavy on
transaction logs.
_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

For information purposes: We have no Seaside in this system ... "only" zinc tasks serving rest requests.

Marten

Otto Behrens <[hidden email]> hat am 24. Oktober 2016 um 11:20 geschrieben:

I suspect it is the persistence of the seaside continuations, with all
the temps and things that go with it. We have the same problem; our
number of users are picking up.

I believe that there is a way to make the continuations transient,
which we have not tried.

We load balance across gemstone sessions, which may influence what you do.

On Mon, Oct 24, 2016 at 10:48 AM, Marten Feldtmann via Glass
<[hidden email]> wrote:
Ok, I have to make this perhaps a little bit clearer:

In my test cases I have 200 to 400 clients (written in Python) playing their
role as client simulators. They do their requests via http/rest against
Gemstone/s against 8 responding gem/zinc tasks (running with different
configurations). They produce in total (10-20) events per seconds against
the database. These events are changing or adding data - no deletion at this
point.

In terms of speed Gemstone has no problem answering the requests (even with
only one answering tasks it would work).

The current database file is around 64 GB of size.

The size of the uncompressed backup file is around 4 GB.

As I mentioned in my older postings: this system produces 1GB transaction
log files within 4-5 minutes.Therefore we have around 100GB transaction data
each day ... wow.

This seems to indicate, that not the extents files will be a problem, but
the management of the transaction files is far more difficult to handle (and
I was not aware of this :-))).

Question to the more experienced Gemstone/S developers: is this ok (a
general pattern) ?

Marten Feldtmann via Glass <[hidden email]> hat am 22.
Oktober 2016 um 22:08 geschrieben:

One of the most surprising things I noticed with my application is the
tremendous storage need of my transaction logs.

Whenever I test my application (with around 200 clients) I get one
transaction log file within 4 minutes - each with a size of 1 GB. That would
lead to 120GB transaction logs within one days with 8 hours working time.

And of course I do full transaction log .... the database size after doing a
backup is 3.4 GB. Normally the clients do not do so much work - but they
send lots of events ...

I looked at the inventory and I did not find any strange things ... any
ideas how to get an idea why such an amount of space is needed for the
transaction logs ....

How do classes like Set, Arrays and changes to them results in transaction
logs changes ??? I prefer to use Arrays at this moment ...

_______________________________________________ Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass
>

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

Ah, ok, sorry about my hammering on seaside then.

On Mon, Oct 24, 2016 at 1:46 PM, Marten Feldtmann via Glass
<[hidden email]> wrote:

> For information purposes: We have no Seaside in this system ... "only" zinc
> tasks serving rest requests.
>
> Marten
>
> Otto Behrens <[hidden email]> hat am 24. Oktober 2016 um 11:20
> geschrieben:
>
>
>
>
> I suspect it is the persistence of the seaside continuations, with all
> the temps and things that go with it. We have the same problem; our
> number of users are picking up.
>
> I believe that there is a way to make the continuations transient,
> which we have not tried.
>
> We load balance across gemstone sessions, which may influence what you do.
>
> On Mon, Oct 24, 2016 at 10:48 AM, Marten Feldtmann via Glass
> <[hidden email]> wrote:
>
> Ok, I have to make this perhaps a little bit clearer:
>
> In my test cases I have 200 to 400 clients (written in Python) playing their
> role as client simulators. They do their requests via http/rest against
> Gemstone/s against 8 responding gem/zinc tasks (running with different
> configurations). They produce in total (10-20) events per seconds against
> the database. These events are changing or adding data - no deletion at this
> point.
>
> In terms of speed Gemstone has no problem answering the requests (even with
> only one answering tasks it would work).
>
> The current database file is around 64 GB of size.
>
> The size of the uncompressed backup file is around 4 GB.
>
> As I mentioned in my older postings: this system produces 1GB transaction
> log files within 4-5 minutes.Therefore we have around 100GB transaction data
> each day ... wow.
>
> This seems to indicate, that not the extents files will be a problem, but
> the management of the transaction files is far more difficult to handle (and
> I was not aware of this :-))).
>
> Question to the more experienced Gemstone/S developers: is this ok (a
> general pattern) ?
>
>
> Marten Feldtmann via Glass <[hidden email]> hat am 22.
> Oktober 2016 um 22:08 geschrieben:
>
>
> One of the most surprising things I noticed with my application is the
> tremendous storage need of my transaction logs.
>
> Whenever I test my application (with around 200 clients) I get one
> transaction log file within 4 minutes - each with a size of 1 GB. That would
> lead to 120GB transaction logs within one days with 8 hours working time.
>
>
> And of course I do full transaction log .... the database size after doing a
> backup is 3.4 GB. Normally the clients do not do so much work - but they
> send lots of events ...
>
> I looked at the inventory and I did not find any strange things ... any
> ideas how to get an idea why such an amount of space is needed for the
> transaction logs ....
>
> How do classes like Set, Arrays and changes to them results in transaction
> logs changes ??? I prefer to use Arrays at this moment ...
>
>
>
>
> _______________________________________________ Glass mailing list
> [hidden email]
> http://lists.gemtalksystems.com/mailman/listinfo/glass
>
>
> _______________________________________________
> Glass mailing list
> [hidden email]
> http://lists.gemtalksystems.com/mailman/listinfo/glass
>
>>
>
>
> _______________________________________________
> Glass mailing list
> [hidden email]
> http://lists.gemtalksystems.com/mailman/listinfo/glass
>

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

> "more commits than what a gem can react to"

Difficulties like that can happen with continueTransaction, not aborts nor beginTransaction. That said, continueTransaction is usually what a gem is coded to try first when it receives a SigAbort.

Write code so that idle gems abort/begin every so often (like 15 seconds) without having to be signaled. Aborts are low cost. Do an abort when the request comes in (before processing it). Busy gems that are well done should never be in transaction long enough to get a SigAbort.

SigAborts only (normally) go to the gems holding the oldest transaction and time is normally allowed for them to respond. Other gems may be holding a transaction view started a millisecond after those and they would not (normally) be signaled. A trick is to write code to proactively signal more gems than normally would be so that a 45 second allowed response time is able to gain more than a millisecond before the next oldest get signaled. Again though, good code doesn't need to be signaled.

Do you get many commit conflicts? What is your strategy for handling them? Load balancing efforts can sometimes cause more work resolving conflicts. A common strategy is to have many gems feed work to an RCQueue that a single "manager" gem processes without conflicts. Of course then changes are not immediately visible to the requesting gem and more otherwise only-transient objects become persistent.

Paul Baumann

On Oct 24, 2016 7:39 AM, "Otto Behrens" <[hidden email]> wrote:

> If you already know this stuff then your question would be related to
> checkpoint commits not happening in a timely manner. How that can happen and
> how to improve that is a much deeper discussion, but is usually remedied by
> changing gems to not stay with old views of the database. That can mean more
> frequent aborts for example, and starting a transaction a short time before
> making changes to commit. Just one gem staying in-transaction for a long
> time can hold up checkpoint commits. A developer logged into (and
> in-transaction) a very active database can hold up checkpoints causing a
> backlog of transactions not yet applied to extent files.

I understand that if there are gems holding onto old views with other
gems committing a lot, it may happen that more garbage than usual will
end up in the extent and the gems will take longer to commit because
they have more work to determine the write set union etc. I suppose if
things start to overflow one could create bigger and bigger
transaction logs because a gem would be committing temporary objects
to the repository.

If one install proper SigAbort handlers for gems then the gems should
be catching them and aborting, unless there are way more commits than
what the gems can react to via sig aborts? Is it in this case that the
condition may occur?

In our application, we start a number of gems that service seaside
requests. These gems (as well as the other gems we run in the
background), all have sig abort handlers. The load balancing also
helps to ensure that the seaside based application distributes work to
idle gems. So in my view, gems should be cooperating well.

In spite of this, we still see 10GB of tranlogs created in a few hours
under low load. (The db is around 23GB).

I suppose the only way to really analyse what's going on is to run
statmonitor and understand the output. From my previous attempts
though, it appears as if the Seaside infrastructure is really heavy on
transaction logs.

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

Concerning transactions: Well, all Topaz scripts (starting the Zinc server) are started in #manualBegin transactionMode. If not in that state the handling of SigAbort handlers is more difficult.

Other than that: after an incoming request a BeginTransaction is done, the work is done and a commit or abort is then done ... then the gem is again out of an transaction - and then it should be no candidate for SigAbort handler (I'm not sure of that last statement).

Conflicts - yes initially lots of. After that we introduce retries on the server side and now the conflicts are not very often - though the external programmers should always consider this ( ... and its not done).

Other conflicts can not be solved in this way and need a different API behaviour (normally implemented with RcQueue and worker topaz processes on the server side). The offered API then returns a TASK id and the programmer has to watch for its task ... long-term logic tasks can be programmed in a conflict-free way then on the server side.

For logic conflicts we have a revision counter in the objects.

(We use an API-HTTP-REST-RPC oriented way of programming the Gemstone/S database when interacting via C#, Python or Javascript with the database).

Paul Baumann <[hidden email]> hat am 24. Oktober 2016 um 14:37 geschrieben:

> "more commits than what a gem can react to"
Difficulties like that can happen with continueTransaction, not aborts nor beginTransaction. That said, continueTransaction is usually what a gem is coded to try first when it receives a SigAbort.
Write code so that idle gems abort/begin every so often (like 15 seconds) without having to be signaled. Aborts are low cost. Do an abort when the request comes in (before processing it). Busy gems that are well done should never be in transaction long enough to get a SigAbort.
SigAborts only (normally) go to the gems holding the oldest transaction and time is normally allowed for them to respond. Other gems may be holding a transaction view started a millisecond after those and they would not (normally) be signaled. A trick is to write code to proactively signal more gems than normally would be so that a 45 second allowed response time is able to gain more than a millisecond before the next oldest get signaled. Again though, good code doesn't need to be signaled.
Do you get many commit conflicts? What is your strategy for handling them? Load balancing efforts can sometimes cause more work resolving conflicts. A common strategy is to have many gems feed work to an RCQueue that a single "manager" gem processes without conflicts. Of course then changes are not immediately visible to the requesting gem and more otherwise only-transient objects become persistent.
Paul Baumann

On Oct 24, 2016 7:39 AM, "Otto Behrens" <[hidden email]> wrote:
> If you already know this stuff then your question would be related to
> checkpoint commits not happening in a timely manner. How that can happen and
> how to improve that is a much deeper discussion, but is usually remedied by
> changing gems to not stay with old views of the database. That can mean more
> frequent aborts for example, and starting a transaction a short time before
> making changes to commit. Just one gem staying in-transaction for a long
> time can hold up checkpoint commits. A developer logged into (and
> in-transaction) a very active database can hold up checkpoints causing a
> backlog of transactions not yet applied to extent files.

I understand that if there are gems holding onto old views with other
gems committing a lot, it may happen that more garbage than usual will
end up in the extent and the gems will take longer to commit because
they have more work to determine the write set union etc. I suppose if
things start to overflow one could create bigger and bigger
transaction logs because a gem would be committing temporary objects
to the repository.

If one install proper SigAbort handlers for gems then the gems should
be catching them and aborting, unless there are way more commits than
what the gems can react to via sig aborts? Is it in this case that the
condition may occur?

In our application, we start a number of gems that service seaside
requests. These gems (as well as the other gems we run in the
background), all have sig abort handlers. The load balancing also
helps to ensure that the seaside based application distributes work to
idle gems. So in my view, gems should be cooperating well.

In spite of this, we still see 10GB of tranlogs created in a few hours
under low load. (The db is around 23GB).

I suppose the only way to really analyse what's going on is to run
statmonitor and understand the output. From my previous attempts
though, it appears as if the Seaside infrastructure is really heavy on
transaction logs.

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

Manual begin mode can still refer to an old view of data, it is not the same as transactionless mode where the view is allowed to automatically adjust to show newer changes. You still would benefit from periodic aborts to avoid the signals to abort. Transactionless mode does not get SigAbort.

Paul Baumann

On Oct 24, 2016 9:13 AM, "Marten Feldtmann" <[hidden email]> wrote:

Concerning transactions: Well, all Topaz scripts (starting the Zinc server) are started in #manualBegin transactionMode. If not in that state the handling of SigAbort handlers is more difficult.
Other than that: after an incoming request a BeginTransaction is done, the work is done and a commit or abort is then done ... then the gem is again out of an transaction - and then it should be no candidate for SigAbort handler (I'm not sure of that last statement).
Conflicts - yes initially lots of. After that we introduce retries on the server side and now the conflicts are not very often - though the external programmers should always consider this ( ... and its not done).
Other conflicts can not be solved in this way and need a different API behaviour (normally implemented with RcQueue and worker topaz processes on the server side). The offered API then returns a TASK id and the programmer has to watch for its task ... long-term logic tasks can be programmed in a conflict-free way then on the server side.
For logic conflicts we have a revision counter in the objects.
(We use an API-HTTP-REST-RPC oriented way of programming the Gemstone/S database when interacting via C#, Python or Javascript with the database).
Paul Baumann <[hidden email]> hat am 24. Oktober 2016 um 14:37 geschrieben:

> "more commits than what a gem can react to"
Difficulties like that can happen with continueTransaction, not aborts nor beginTransaction. That said, continueTransaction is usually what a gem is coded to try first when it receives a SigAbort.
Write code so that idle gems abort/begin every so often (like 15 seconds) without having to be signaled. Aborts are low cost. Do an abort when the request comes in (before processing it). Busy gems that are well done should never be in transaction long enough to get a SigAbort.
SigAborts only (normally) go to the gems holding the oldest transaction and time is normally allowed for them to respond. Other gems may be holding a transaction view started a millisecond after those and they would not (normally) be signaled. A trick is to write code to proactively signal more gems than normally would be so that a 45 second allowed response time is able to gain more than a millisecond before the next oldest get signaled. Again though, good code doesn't need to be signaled.
Do you get many commit conflicts? What is your strategy for handling them? Load balancing efforts can sometimes cause more work resolving conflicts. A common strategy is to have many gems feed work to an RCQueue that a single "manager" gem processes without conflicts. Of course then changes are not immediately visible to the requesting gem and more otherwise only-transient objects become persistent.
Paul Baumann

On Oct 24, 2016 7:39 AM, "Otto Behrens" <[hidden email]> wrote:
> If you already know this stuff then your question would be related to
> checkpoint commits not happening in a timely manner. How that can happen and
> how to improve that is a much deeper discussion, but is usually remedied by
> changing gems to not stay with old views of the database. That can mean more
> frequent aborts for example, and starting a transaction a short time before
> making changes to commit. Just one gem staying in-transaction for a long
> time can hold up checkpoint commits. A developer logged into (and
> in-transaction) a very active database can hold up checkpoints causing a
> backlog of transactions not yet applied to extent files.

I understand that if there are gems holding onto old views with other
gems committing a lot, it may happen that more garbage than usual will
end up in the extent and the gems will take longer to commit because
they have more work to determine the write set union etc. I suppose if
things start to overflow one could create bigger and bigger
transaction logs because a gem would be committing temporary objects
to the repository.

If one install proper SigAbort handlers for gems then the gems should
be catching them and aborting, unless there are way more commits than
what the gems can react to via sig aborts? Is it in this case that the
condition may occur?

In our application, we start a number of gems that service seaside
requests. These gems (as well as the other gems we run in the
background), all have sig abort handlers. The load balancing also
helps to ensure that the seaside based application distributes work to
idle gems. So in my view, gems should be cooperating well.

In spite of this, we still see 10GB of tranlogs created in a few hours
under low load. (The db is around 23GB).

I suppose the only way to really analyse what's going on is to run
statmonitor and understand the output. From my previous attempts
though, it appears as if the Seaside infrastructure is really heavy on
transaction logs.

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

That's very interesting.

This means, that I should introduce my own subclass of ZnServer and its #serveConnectionOn: (or the #noteAcceptWaitTimedOut) should be rewritten to make an empty transaction (with #beginTransaction and #abortTransaction) on a periodical base (15 seconds ?).

Marten

Paul Baumann <[hidden email]> hat am 24. Oktober 2016 um 15:41 geschrieben:

Manual begin mode can still refer to an old view of data, it is not the same as transactionless mode where the view is allowed to automatically adjust to show newer changes. You still would benefit from periodic aborts to avoid the signals to abort. Transactionless mode does not get SigAbort.
Paul Baumann

On Oct 24, 2016 9:13 AM, "Marten Feldtmann" <[hidden email]> wrote:
Concerning transactions: Well, all Topaz scripts (starting the Zinc server) are started in #manualBegin transactionMode. If not in that state the handling of SigAbort handlers is more difficult.
Other than that: after an incoming request a BeginTransaction is done, the work is done and a commit or abort is then done ... then the gem is again out of an transaction - and then it should be no candidate for SigAbort handler (I'm not sure of that last statement).
Conflicts - yes initially lots of. After that we introduce retries on the server side and now the conflicts are not very often - though the external programmers should always consider this ( ... and its not done).
Other conflicts can not be solved in this way and need a different API behaviour (normally implemented with RcQueue and worker topaz processes on the server side). The offered API then returns a TASK id and the programmer has to watch for its task ... long-term logic tasks can be programmed in a conflict-free way then on the server side.
For logic conflicts we have a revision counter in the objects.
(We use an API-HTTP-REST-RPC oriented way of programming the Gemstone/S database when interacting via C#, Python or Javascript with the database).
Paul Baumann <[hidden email]> hat am 24. Oktober 2016 um 14:37 geschrieben:

> "more commits than what a gem can react to"
Difficulties like that can happen with continueTransaction, not aborts nor beginTransaction. That said, continueTransaction is usually what a gem is coded to try first when it receives a SigAbort.
Write code so that idle gems abort/begin every so often (like 15 seconds) without having to be signaled. Aborts are low cost. Do an abort when the request comes in (before processing it). Busy gems that are well done should never be in transaction long enough to get a SigAbort.
SigAborts only (normally) go to the gems holding the oldest transaction and time is normally allowed for them to respond. Other gems may be holding a transaction view started a millisecond after those and they would not (normally) be signaled. A trick is to write code to proactively signal more gems than normally would be so that a 45 second allowed response time is able to gain more than a millisecond before the next oldest get signaled. Again though, good code doesn't need to be signaled.
Do you get many commit conflicts? What is your strategy for handling them? Load balancing efforts can sometimes cause more work resolving conflicts. A common strategy is to have many gems feed work to an RCQueue that a single "manager" gem processes without conflicts. Of course then changes are not immediately visible to the requesting gem and more otherwise only-transient objects become persistent.
Paul Baumann

On Oct 24, 2016 7:39 AM, "Otto Behrens" <[hidden email]> wrote:
> If you already know this stuff then your question would be related to
> checkpoint commits not happening in a timely manner. How that can happen and
> how to improve that is a much deeper discussion, but is usually remedied by
> changing gems to not stay with old views of the database. That can mean more
> frequent aborts for example, and starting a transaction a short time before
> making changes to commit. Just one gem staying in-transaction for a long
> time can hold up checkpoint commits. A developer logged into (and
> in-transaction) a very active database can hold up checkpoints causing a
> backlog of transactions not yet applied to extent files.

I understand that if there are gems holding onto old views with other
gems committing a lot, it may happen that more garbage than usual will
end up in the extent and the gems will take longer to commit because
they have more work to determine the write set union etc. I suppose if
things start to overflow one could create bigger and bigger
transaction logs because a gem would be committing temporary objects
to the repository.

If one install proper SigAbort handlers for gems then the gems should
be catching them and aborting, unless there are way more commits than
what the gems can react to via sig aborts? Is it in this case that the
condition may occur?

In our application, we start a number of gems that service seaside
requests. These gems (as well as the other gems we run in the
background), all have sig abort handlers. The load balancing also
helps to ensure that the seaside based application distributes work to
idle gems. So in my view, gems should be cooperating well.

In spite of this, we still see 10GB of tranlogs created in a few hours
under low load. (The db is around 23GB).

I suppose the only way to really analyse what's going on is to run
statmonitor and understand the output. From my previous attempts
though, it appears as if the Seaside infrastructure is really heavy on
transaction logs.

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

On 10/24/2016 02:20 AM, Otto Behrens via Glass wrote:
> I suspect it is the persistence of the seaside continuations, with all
> the temps and things that go with it. We have the same problem; our
> number of users are picking up.
>
> I believe that there is a way to make the continuations transient,
> which we have not tried.
I think that if you make continuations non-persistent, then you will
need to make sure that you have session affinity[1] ... this could cost
in needing to have enough gems to handle the number of concurrent "long
term" sessions you have ... I did experiments many years ago and I think
that if you have a known number of users/browser sessions, then this
approach could work ... let me know if there is interest here and I can
blow the dust off of that work and give you some pointers/help.

I've also done some work with notTranlogged session data ... back when I
was doing scaling trying to get to 10,000 requests second (hit 7000) I
noticed the growth in tranlogs and a notTranlogged feature was
subsequently added to GemStone ... until now I haven't heard about users
really needing this feature -- even now it is in a stress testing
context -- but a year ago I took some time to some to do some
experiments with notTranlogged session state for Seaside and the work is
on gitub[2]. I was able to my NotTranlogged experiment to function, but
I never did any comparison tests to find out how much would actually be
saved ...

If there is interest in bringing this up-to-date perhaps we could work
together as a group to start doing experiments in this area and see if
we can make a dent on the size of the large tranlogs.

Dale

[1]
https://gemstonesoup.wordpress.com/2008/08/19/1-session-per-vm-another-scaling-alternative/
[2] https://github.com/dalehenrich/Seaside/tree/notTranlogged

>
> We load balance across gemstone sessions, which may influence what you do.
>
> On Mon, Oct 24, 2016 at 10:48 AM, Marten Feldtmann via Glass
> <[hidden email]> wrote:
>> Ok, I have to make this perhaps a little bit clearer:
>>
>> In my test cases I have 200 to 400 clients (written in Python) playing their
>> role as client simulators. They do their requests via http/rest against
>> Gemstone/s against 8 responding gem/zinc tasks (running with different
>> configurations). They produce in total (10-20) events per seconds against
>> the database. These events are changing or adding data - no deletion at this
>> point.
>>
>> In terms of speed Gemstone has no problem answering the requests (even with
>> only one answering tasks it would work).
>>
>> The current database file is around 64 GB of size.
>>
>> The size of the uncompressed backup file is around 4 GB.
>>
>> As I mentioned in my older postings: this system produces 1GB transaction
>> log files within 4-5 minutes.Therefore we have around 100GB transaction data
>> each day ... wow.
>>
>> This seems to indicate, that not the extents files will be a problem, but
>> the management of the transaction files is far more difficult to handle (and
>> I was not aware of this :-))).
>>
>> Question to the more experienced Gemstone/S developers: is this ok (a
>> general pattern) ?
>>
>>
>> Marten Feldtmann via Glass <[hidden email]> hat am 22.
>> Oktober 2016 um 22:08 geschrieben:
>>
>>
>> One of the most surprising things I noticed with my application is the
>> tremendous storage need of my transaction logs.
>>
>> Whenever I test my application (with around 200 clients) I get one
>> transaction log file within 4 minutes - each with a size of 1 GB. That would
>> lead to 120GB transaction logs within one days with 8 hours working time.
>>
>>
>> And of course I do full transaction log .... the database size after doing a
>> backup is 3.4 GB. Normally the clients do not do so much work - but they
>> send lots of events ...
>>
>> I looked at the inventory and I did not find any strange things ... any
>> ideas how to get an idea why such an amount of space is needed for the
>> transaction logs ....
>>
>> How do classes like Set, Arrays and changes to them results in transaction
>> logs changes ??? I prefer to use Arrays at this moment ...
>>
>>
>>
>>
>> _______________________________________________ Glass mailing list
>> [hidden email]
>> http://lists.gemtalksystems.com/mailman/listinfo/glass
>>
>>
>> _______________________________________________
>> Glass mailing list
>> [hidden email]
>> http://lists.gemtalksystems.com/mailman/listinfo/glass
>>
> _______________________________________________
> Glass mailing list
> [hidden email]
> http://lists.gemtalksystems.com/mailman/listinfo/glass

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

On 10/24/2016 04:39 AM, Otto Behrens via Glass wrote:

>> If you already know this stuff then your question would be related to
>> checkpoint commits not happening in a timely manner. How that can happen and
>> how to improve that is a much deeper discussion, but is usually remedied by
>> changing gems to not stay with old views of the database. That can mean more
>> frequent aborts for example, and starting a transaction a short time before
>> making changes to commit. Just one gem staying in-transaction for a long
>> time can hold up checkpoint commits. A developer logged into (and
>> in-transaction) a very active database can hold up checkpoints causing a
>> backlog of transactions not yet applied to extent files.
> I understand that if there are gems holding onto old views with other
> gems committing a lot, it may happen that more garbage than usual will
> end up in the extent and the gems will take longer to commit because
> they have more work to determine the write set union etc. I suppose if
> things start to overflow one could create bigger and bigger
> transaction logs because a gem would be committing temporary objects
> to the repository.
>
> If one install proper SigAbort handlers for gems then the gems should
> be catching them and aborting, unless there are way more commits than
> what the gems can react to via sig aborts? Is it in this case that the
> condition may occur?
>
> In our application, we start a number of gems that service seaside
> requests. These gems (as well as the other gems we run in the
> background), all have sig abort handlers. The load balancing also
> helps to ensure that the seaside based application distributes work to
> idle gems. So in my view, gems should be cooperating well.
>
> In spite of this, we still see 10GB of tranlogs created in a few hours
> under low load. (The db is around 23GB).
>
> I suppose the only way to really analyse what's going on is to run
> statmonitor and understand the output. From my previous attempts
> though, it appears as if the Seaside infrastructure is really heavy on
> transaction logs.

Otto, you are absolutely right ... one of the things that makes GemStone
fast is that the session state (mostly continuations) is not held in
transient memory but written to disk --- with a cost in tranlog size ---
then use MFC to clean up the mess ...

There are possibilities that the large tranlogs are being cause by
updates to large, sequenceable collections (like OrderedCollection and
SortedCollection) and others ... we have some basic tools that we ship
with the product that can be used to analyze the content of tranlogs and
with a bit of work can be used to tell us what is in the tranlogs
(continuations will show up as a lot of GsProcess instances for example)
and start to get a handle on the problem ..

Again as a group effort I can help with instructions on doing the
analysis and then we can see what can be done ... we have done some work
in reducing the size of tranlogs for somes type of data, but not all so
getting information from you guys would help us understand where to put
our emphasis --- whether be on adding additional features to GemStone,
fixing bugs, or working on notTranlogged session state for Seaside.

Dale
_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

On 10/24/2016 04:22 AM, Paul Baumann via Glass wrote:

>
> Tranlog files at first have transaction sequence not yet represented
> in the random-access database extent files. Sequential files are
> faster to update and changes can be replayed to extents of different
> ages (like from restore or cloning). Tranlog files that only contain
> transactions older than the last Checkpoint Commit can be archived. A
> checkpoint commit is the point where extents now contain the tranlog
> changes. You do not need to keep full tranlog files older than the
> checkpoint commit record in the extents backup that you may wish
> restore. GS DBAs write shell scripts to manage these things, knowing
> what tranlog files are no longer useful to keep.
>
> If you already know this stuff then your question would be related to
> checkpoint commits not happening in a timely manner. How that can
> happen and how to improve that is a much deeper discussion, but is
> usually remedied by changing gems to not stay with old views of the
> database. That can mean more frequent aborts for example, and starting
> a transaction a short time before making changes to commit. Just one
> gem staying in-transaction for a long time can hold up checkpoint
> commits. A developer logged into (and in-transaction) a very active
> database can hold up checkpoints causing a backlog of transactions not
> yet applied to extent files.
>

Paul, a CR backlog dos not cause tranlogs to grow ... CR backlog shows
up as extent growth not tranlog file growth...

Dale
_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

I recall using several approaches, I'm not sure which I stayed with. It was nine years ago, and I've also addressed this for GBS clients. A server subclass was one, because there was no better hook. I recall there being a facility that could be registered for transaction changes. I recall noting an expiration time some seconds from current. I recall Norm had added (or perhaps optimized) a way to determine view age at my request. I recall the subclass being to add an instvar and because #continueTransaction needed special handling (like it did not notify as the others). Those were ways to know view age in a general way, and hopefully obsolete way for more recent releases. I recall a "busy while" global being updated and checked to defer aborts by a background process (which might have been GBS specific code). Sorry, I'd have to see the code to help with specifics. What you've said sounds familiar though.

Active management of view age relieved serious performance risks (to company operations) that had previously existed with reliance on reactionary SigAbort. Avoid long-held database views. We went from serious backlog alarms triggered several times a day to eventually none at all and quick recovery for any commit record backlog.

Paul Baumann

On Oct 24, 2016 10:43 AM, "Marten Feldtmann via Glass" <[hidden email]> wrote:

That's very interesting.
This means, that I should introduce my own subclass of ZnServer and its #serveConnectionOn: (or the #noteAcceptWaitTimedOut) should be rewritten to make an empty transaction (with #beginTransaction and #abortTransaction) on a periodical base (15 seconds ?).
Marten
Paul Baumann <[hidden email]> hat am 24. Oktober 2016 um 15:41 geschrieben:

Manual begin mode can still refer to an old view of data, it is not the same as transactionless mode where the view is allowed to automatically adjust to show newer changes. You still would benefit from periodic aborts to avoid the signals to abort. Transactionless mode does not get SigAbort.
Paul Baumann

On Oct 24, 2016 9:13 AM, "Marten Feldtmann" <[hidden email]> wrote:
Concerning transactions: Well, all Topaz scripts (starting the Zinc server) are started in #manualBegin transactionMode. If not in that state the handling of SigAbort handlers is more difficult.
Other than that: after an incoming request a BeginTransaction is done, the work is done and a commit or abort is then done ... then the gem is again out of an transaction - and then it should be no candidate for SigAbort handler (I'm not sure of that last statement).
Conflicts - yes initially lots of. After that we introduce retries on the server side and now the conflicts are not very often - though the external programmers should always consider this ( ... and its not done).
Other conflicts can not be solved in this way and need a different API behaviour (normally implemented with RcQueue and worker topaz processes on the server side). The offered API then returns a TASK id and the programmer has to watch for its task ... long-term logic tasks can be programmed in a conflict-free way then on the server side.
For logic conflicts we have a revision counter in the objects.
(We use an API-HTTP-REST-RPC oriented way of programming the Gemstone/S database when interacting via C#, Python or Javascript with the database).
Paul Baumann <[hidden email]> hat am 24. Oktober 2016 um 14:37 geschrieben:

> "more commits than what a gem can react to"
Difficulties like that can happen with continueTransaction, not aborts nor beginTransaction. That said, continueTransaction is usually what a gem is coded to try first when it receives a SigAbort.
Write code so that idle gems abort/begin every so often (like 15 seconds) without having to be signaled. Aborts are low cost. Do an abort when the request comes in (before processing it). Busy gems that are well done should never be in transaction long enough to get a SigAbort.
SigAborts only (normally) go to the gems holding the oldest transaction and time is normally allowed for them to respond. Other gems may be holding a transaction view started a millisecond after those and they would not (normally) be signaled. A trick is to write code to proactively signal more gems than normally would be so that a 45 second allowed response time is able to gain more than a millisecond before the next oldest get signaled. Again though, good code doesn't need to be signaled.
Do you get many commit conflicts? What is your strategy for handling them? Load balancing efforts can sometimes cause more work resolving conflicts. A common strategy is to have many gems feed work to an RCQueue that a single "manager" gem processes without conflicts. Of course then changes are not immediately visible to the requesting gem and more otherwise only-transient objects become persistent.
Paul Baumann

On Oct 24, 2016 7:39 AM, "Otto Behrens" <[hidden email]> wrote:
> If you already know this stuff then your question would be related to
> checkpoint commits not happening in a timely manner. How that can happen and
> how to improve that is a much deeper discussion, but is usually remedied by
> changing gems to not stay with old views of the database. That can mean more
> frequent aborts for example, and starting a transaction a short time before
> making changes to commit. Just one gem staying in-transaction for a long
> time can hold up checkpoint commits. A developer logged into (and
> in-transaction) a very active database can hold up checkpoints causing a
> backlog of transactions not yet applied to extent files.

I understand that if there are gems holding onto old views with other
gems committing a lot, it may happen that more garbage than usual will
end up in the extent and the gems will take longer to commit because
they have more work to determine the write set union etc. I suppose if
things start to overflow one could create bigger and bigger
transaction logs because a gem would be committing temporary objects
to the repository.

If one install proper SigAbort handlers for gems then the gems should
be catching them and aborting, unless there are way more commits than
what the gems can react to via sig aborts? Is it in this case that the
condition may occur?

In our application, we start a number of gems that service seaside
requests. These gems (as well as the other gems we run in the
background), all have sig abort handlers. The load balancing also
helps to ensure that the seaside based application distributes work to
idle gems. So in my view, gems should be cooperating well.

In spite of this, we still see 10GB of tranlogs created in a few hours
under low load. (The db is around 23GB).

I suppose the only way to really analyse what's going on is to run
statmonitor and understand the output. From my previous attempts
though, it appears as if the Seaside infrastructure is really heavy on
transaction logs.

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

Of course Dale. I had brought the discussion to other areas. It had sounded like more tranlog files were kept than necessary. I had not touched at all on how to reduce how quickly they grow.

On Oct 24, 2016 1:54 PM, "Dale Henrichs via Glass" <[hidden email]> wrote:

On 10/24/2016 04:22 AM, Paul Baumann via Glass wrote:

Tranlog files at first have transaction sequence not yet represented in the random-access database extent files. Sequential files are faster to update and changes can be replayed to extents of different ages (like from restore or cloning). Tranlog files that only contain transactions older than the last Checkpoint Commit can be archived. A checkpoint commit is the point where extents now contain the tranlog changes. You do not need to keep full tranlog files older than the checkpoint commit record in the extents backup that you may wish restore. GS DBAs write shell scripts to manage these things, knowing what tranlog files are no longer useful to keep.

If you already know this stuff then your question would be related to checkpoint commits not happening in a timely manner. How that can happen and how to improve that is a much deeper discussion, but is usually remedied by changing gems to not stay with old views of the database. That can mean more frequent aborts for example, and starting a transaction a short time before making changes to commit. Just one gem staying in-transaction for a long time can hold up checkpoint commits. A developer logged into (and in-transaction) a very active database can hold up checkpoints causing a backlog of transactions not yet applied to extent files.

Paul, a CR backlog dos not cause tranlogs to grow ... CR backlog shows up as extent growth not tranlog file growth...

Dale
_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

On 10/24/2016 06:12 AM, Marten Feldtmann via Glass wrote:
>
> Concerning transactions: Well, all Topaz scripts (starting the Zinc
> server) are started in #manualBegin transactionMode. If not in that
> state the handling of SigAbort handlers is more difficult.
>
SigAbort handling has nothing to do with tranlog growth, so don't mess
with this ...
>
> Other than that: after an incoming request a BeginTransaction is done,
> the work is done and a commit or abort is then done ... then the gem
> is again out of an transaction - and then it should be no candidate
> for SigAbort handler (I'm not sure of that last statement).
>
Again ... Sigabort handling does not affect the amount of data that is
dumped to a tranlog

>
> Conflicts - yes initially lots of. After that we introduce retries on
> the server side and now the conflicts are not very often - though the
> external programmers should always consider this ( ... and its not done).
>
> Other conflicts can not be solved in this way and need a different API
> behaviour (normally implemented with RcQueue and worker topaz
> processes on the server side). The offered API then returns a TASK id
> and the programmer has to watch for its task ... long-term logic tasks
> can be programmed in a conflict-free way then on the server side.
>
> For logic conflicts we have a revision counter in the objects.
>
> (We use an API-HTTP-REST-RPC oriented way of programming the
> Gemstone/S database when interacting via C#, Python or Javascript with
> the database).
>

Since you are not using Seaside we cannot blame continuations, so we
have to look for other sources of tranlog growth ...

Do you have large sorted collections or other SequenceableCollection
collections where you insert data in the middle of the collection? If
you do we write tranlog entries for the shift of the data in these types
of collections ...

If you use indexes it is possible for the RcIndexDictionary collision
buckets to get large and they will behave like a SequenceableCollection
collection when it comes to inserts ...

I think the best bet here is to first identify what's being dumped into
your tranlogs and then go from there ... we've recently had another
customer with tranlog growth issues and the engineer that handled that
is out of the office ... This would actually be a good case for
submitting an HR and then work directly on this issue with tech support
to help you identify the source of the tranlog growth and work out
strategies for reducing or otherwise addressing the problem.

Dale
_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

I thought that Marten was asking about large transaction logs and I didn't see that he had trouble with large extents as he is using a Sigabort Handler ...

Dale

On 10/24/2016 11:06 AM, Paul Baumann wrote:

Of course Dale. I had brought the discussion to other areas. It had sounded like more tranlog files were kept than necessary. I had not touched at all on how to reduce how quickly they grow.

On Oct 24, 2016 1:54 PM, "Dale Henrichs via Glass" <[hidden email]> wrote:

On 10/24/2016 04:22 AM, Paul Baumann via Glass wrote:

Tranlog files at first have transaction sequence not yet represented in the random-access database extent files. Sequential files are faster to update and changes can be replayed to extents of different ages (like from restore or cloning). Tranlog files that only contain transactions older than the last Checkpoint Commit can be archived. A checkpoint commit is the point where extents now contain the tranlog changes. You do not need to keep full tranlog files older than the checkpoint commit record in the extents backup that you may wish restore. GS DBAs write shell scripts to manage these things, knowing what tranlog files are no longer useful to keep.

If you already know this stuff then your question would be related to checkpoint commits not happening in a timely manner. How that can happen and how to improve that is a much deeper discussion, but is usually remedied by changing gems to not stay with old views of the database. That can mean more frequent aborts for example, and starting a transaction a short time before making changes to commit. Just one gem staying in-transaction for a long time can hold up checkpoint commits. A developer logged into (and in-transaction) a very active database can hold up checkpoints causing a backlog of transactions not yet applied to extent files.

Paul, a CR backlog dos not cause tranlogs to grow ... CR backlog shows up as extent growth not tranlog file growth...

Dale
_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

In reply to this post by GLASS mailing list

Marten,

To be clear.

1GB in 5 minutes is 60Mb of tranlog data per second. At 20 events/second
you are producing 3Mb per event ... is that reasonable given the
application data you are saving per event from an application perspective?

If you still think that that is excessive, then I think submitting a HR
is called for. Tech support can walk you through the tranlog analysis
steps and help determine if what you are experiencing a problem that has
already been fixed (the other customer with tranlog growth problems) or
if you are experiencing "normal growth" --- we are aware of additional
optimizations that can be made for tranlogs, but those optimizations are
not yet available and there also the possibility of switching to another
data structure, or you may even be persisting more data than you expect
-- without analysis we can only guess.

I'm not sure that you need to change the SigAbort handler that we talked
about at ESUG. I am under the impression that you are not concerned
about extent growth and if so, then that SigAbort handler is doing it's
job.

Dale

On 10/24/2016 11:12 AM, Dale Henrichs wrote:

>
>
> On 10/24/2016 06:12 AM, Marten Feldtmann via Glass wrote:
>>
>> Concerning transactions: Well, all Topaz scripts (starting the Zinc
>> server) are started in #manualBegin transactionMode. If not in that
>> state the handling of SigAbort handlers is more difficult.
>>
> SigAbort handling has nothing to do with tranlog growth, so don't mess
> with this ...
>>
>> Other than that: after an incoming request a BeginTransaction is
>> done, the work is done and a commit or abort is then done ... then
>> the gem is again out of an transaction - and then it should be no
>> candidate for SigAbort handler (I'm not sure of that last statement).
>>
> Again ... Sigabort handling does not affect the amount of data that is
> dumped to a tranlog
>>
>> Conflicts - yes initially lots of. After that we introduce retries on
>> the server side and now the conflicts are not very often - though the
>> external programmers should always consider this ( ... and its not
>> done).
>>
>> Other conflicts can not be solved in this way and need a different
>> API behaviour (normally implemented with RcQueue and worker topaz
>> processes on the server side). The offered API then returns a TASK id
>> and the programmer has to watch for its task ... long-term logic
>> tasks can be programmed in a conflict-free way then on the server side.
>>
>> For logic conflicts we have a revision counter in the objects.
>>
>> (We use an API-HTTP-REST-RPC oriented way of programming the
>> Gemstone/S database when interacting via C#, Python or Javascript
>> with the database).
>>
>
> Since you are not using Seaside we cannot blame continuations, so we
> have to look for other sources of tranlog growth ...
>
> Do you have large sorted collections or other SequenceableCollection
> collections where you insert data in the middle of the collection? If
> you do we write tranlog entries for the shift of the data in these
> types of collections ...
>
> If you use indexes it is possible for the RcIndexDictionary collision
> buckets to get large and they will behave like a
> SequenceableCollection collection when it comes to inserts ...
>
> I think the best bet here is to first identify what's being dumped
> into your tranlogs and then go from there ... we've recently had
> another customer with tranlog growth issues and the engineer that
> handled that is out of the office ... This would actually be a good
> case for submitting an HR and then work directly on this issue with
> tech support to help you identify the source of the tranlog growth and
> work out strategies for reducing or otherwise addressing the problem.
>
> Dale

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass

GLASS mailing list

Re: Transaction Logs so large ....

Well Richard pointed out that I did the math wrong at several steps ---
it's good to show your work (part of being clear:), then others can
point out your mistakes.

The math should have been:

1GB in 5 minutes - 3.4Mb of tranlog data per second or 170kb/event

Which may or may not be excessive depending upon what you are doing in
your application...

Dale

On 10/24/2016 11:34 AM, Dale Henrichs wrote:

> Marten,
>
> To be clear.
>
> 1GB in 5 minutes is 60Mb of tranlog data per second. At 20
> events/second you are producing 3Mb per event ... is that reasonable
> given the application data you are saving per event from an
> application perspective?
>
> If you still think that that is excessive, then I think submitting a
> HR is called for. Tech support can walk you through the tranlog
> analysis steps and help determine if what you are experiencing a
> problem that has already been fixed (the other customer with tranlog
> growth problems) or if you are experiencing "normal growth" --- we are
> aware of additional optimizations that can be made for tranlogs, but
> those optimizations are not yet available and there also the
> possibility of switching to another data structure, or you may even be
> persisting more data than you expect -- without analysis we can only
> guess.
>
> I'm not sure that you need to change the SigAbort handler that we
> talked about at ESUG. I am under the impression that you are not
> concerned about extent growth and if so, then that SigAbort handler
> is doing it's job.
>
> Dale
>
>
> On 10/24/2016 11:12 AM, Dale Henrichs wrote:
>>
>>
>> On 10/24/2016 06:12 AM, Marten Feldtmann via Glass wrote:
>>>
>>> Concerning transactions: Well, all Topaz scripts (starting the Zinc
>>> server) are started in #manualBegin transactionMode. If not in that
>>> state the handling of SigAbort handlers is more difficult.
>>>
>> SigAbort handling has nothing to do with tranlog growth, so don't
>> mess with this ...
>>>
>>> Other than that: after an incoming request a BeginTransaction is
>>> done, the work is done and a commit or abort is then done ... then
>>> the gem is again out of an transaction - and then it should be no
>>> candidate for SigAbort handler (I'm not sure of that last statement).
>>>
>> Again ... Sigabort handling does not affect the amount of data that
>> is dumped to a tranlog
>>>
>>> Conflicts - yes initially lots of. After that we introduce retries
>>> on the server side and now the conflicts are not very often - though
>>> the external programmers should always consider this ( ... and its
>>> not done).
>>>
>>> Other conflicts can not be solved in this way and need a different
>>> API behaviour (normally implemented with RcQueue and worker topaz
>>> processes on the server side). The offered API then returns a TASK
>>> id and the programmer has to watch for its task ... long-term logic
>>> tasks can be programmed in a conflict-free way then on the server side.
>>>
>>> For logic conflicts we have a revision counter in the objects.
>>>
>>> (We use an API-HTTP-REST-RPC oriented way of programming the
>>> Gemstone/S database when interacting via C#, Python or Javascript
>>> with the database).
>>>
>>
>> Since you are not using Seaside we cannot blame continuations, so we
>> have to look for other sources of tranlog growth ...
>>
>> Do you have large sorted collections or other SequenceableCollection
>> collections where you insert data in the middle of the collection? If
>> you do we write tranlog entries for the shift of the data in these
>> types of collections ...
>>
>> If you use indexes it is possible for the RcIndexDictionary
>> collision buckets to get large and they will behave like a
>> SequenceableCollection collection when it comes to inserts ...
>>
>> I think the best bet here is to first identify what's being dumped
>> into your tranlogs and then go from there ... we've recently had
>> another customer with tranlog growth issues and the engineer that
>> handled that is out of the office ... This would actually be a good
>> case for submitting an HR and then work directly on this issue with
>> tech support to help you identify the source of the tranlog growth
>> and work out strategies for reducing or otherwise addressing the
>> problem.
>>
>> Dale
>

_______________________________________________
Glass mailing list
[hidden email]
http://lists.gemtalksystems.com/mailman/listinfo/glass