Using SQLAlchemy in Python
SQLAlchemy is an absolutely outstanding, best-in-class ORM library.
However, the default setup is not perfect.
Idle in Transaction
The default behaviour in SQLAlchemy is to open a transaction for read queries.
However, read queries (inside a transaction), will block all database migrations.
Considering, that migrations block all subsequent query requests, this can take down an application.
Thus, even small transactions can become a serious problem.
In particular, consider when sessions are tied to web requests—a transaction could be open for several seconds.
Recommendation
When using the usual READ COMMITTED isolation level:
- use auto-commit for reads
- switch to explicit transactions for writes
Problem description and alternate solution: https://www.gorgias.com/blog/prevent-idle-in-transaction-engineering https://www.oddbird.net/2014/06/14/sqlalchemy-postgres-autocommit/
Weakly Referenced Objects
SQLAlchemy weakly references ORM instances. This means, when you convert them into a core type, they are dropped from the identity map.
The impact is that when it comes to updates, SQLAlchemy will have to inefficiently query the database for the object again.
Recommendation
Use SQLAlchemy's recipe for strong references.
Good to Know
- SQLAlchemy has an identity map that caches queried objects
- SQLAlchemy using mutation tracking and updates the precise fields that changed
Other tips
- Remember to index on ForeignKeys. Postgres does not do this automatically. MySQL always does this.
- Remember to set
onupdate
andondelete
cascades on ForeignKeys. - To prevent spurious joins: (this cannot be used with NULLable outer joins)
session.query(Model).options( sa.orm.joinedload("child").joinedload("grandchild"), sa.orm.raiseload("*"), )