Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce db_pools contrib crate, for asynchronous database connection pooling #1696

Closed
wants to merge 7 commits into from

Conversation

jebrosen
Copy link
Collaborator

@jebrosen jebrosen commented Jun 14, 2021

Resolves #1117.

This implementation is similar to sync_db_pools and derives most of its documentation from that crate, but there are a few notable differences:

  • Database types are declared via a #[derive(Database)] in combination with the #[database(name = "")] attribute, which makes it more straightforward to see what methods are available on the decorated type.
  • Databases are declared in terms of the pool type, not the connection type. This helps avoid requiring adapters to go through a particular pool implementation such as r2d2, and allows us to use built-in connection pooling support that is provided by some database drivers.
  • The decorated database type is placed directly in managed state by its fairing, so that .pool() and .get() are easily accessible after attaching the fairing and from within routes. This is especially useful for integrations that provide additional methods on the pool type which would otherwise be inaccessible.
  • Direct access to the underlying connection pool is available (Expose pool directly from generated database pools #1187), via the .pool() method.
  • All relevant APIs are async, and no additional wrapper (a la run()) is necessary to bridge between async and blocking code.

TODO:

  • Solidify the configuration API. In sync_db_pools, each implementation of Poolable deserializes its config from a Rocket instance. In this implementation, the configuration is restricted any DeserializeOwned type and the deserialization is done by the database fairing - this makes it easier to correctly implement Pool since it is no longer responsible for looking up its configuration, but it is impossible to query the worker count when choosing a pool size. I would appreciate any ideas on how to best reconcile this.
  • pool_size - Different database pools accept different combinations of min or max pool size, and currently most Pool implementations use the default.
  • Configurable timeout for connection retrieval.
  • Tests.
  • Documentation additions and corrections, especially the guide. Most of the documentation here has simply been copied from sync_db_pools and adjusted for the different trait names and provided integrations.

Sorry, something went wrong.

@jebrosen jebrosen requested a review from SergioBenitez June 14, 2021 01:47
@SergioBenitez
Copy link
Member

I am so stoked to see this! This is huge for 0.5. Will prioritize reviewing this ASAP. Thank you!

Copy link
Member

@SergioBenitez SergioBenitez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've left some comments. The biggest issues I see are:

  • Config needs to be redone. I've laid out a structure in the comments.
  • Libraries aren't being used to their fullest extent. Use devise and figment.
  • Lots of items are fully qualified, increasing the verbosity of code unnecessarily.
  • The effective need to declare a type alias is unfortunate. Maybe an associated type helps.

/// [`Pool::initialize()`]: crate::Pool::initialize
#[derive(Serialize, Deserialize, Debug, Clone, PartialEq)]
#[serde(crate = "rocket::serde")]
pub struct Config {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We shouldn't need this structure at all. Each implementation should define their own config structure.

See my other comments in config.

Comment on lines +53 to +58
/// The error type returned by `initialize`.
type InitError: std::error::Error;

/// The error type returned by `get`.
type GetError: std::error::Error;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This feels wrong to me. Why isn't there just one Error?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

InitError is rocket_db_pools::Error, which represents either bad configuration or the database-specific error. GetError could be the same error type, but it would always be the non-configuration variant.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For some database drivers (e.g. deadpool-postgres), the error type returned by initialization vs by get() are actually different. In that case one error type is a superset of the other, but I wouldn't want to assume that about future drivers.

Copy link
Member

@SergioBenitez SergioBenitez Jun 27, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, but that doesn't mean the pool integration can't wrap those into one type.

I think there's a lot of effort right now spent on trying to creates these reusable structures like Config and Error (the least common denominators) when, in reality, each database driver (at least those from each library) will be fairly distinct. I would advocate for eschewing both of these types entirely and instead creating an Error and Config for each database variant/library as needed. With Config specifically, this is pretty important.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

when, in reality, each database driver (at least those from each library) will be fairly distinct

So, I tried using a separate error type for each implementation and ran into every implementation of Pool::initialize needing to return a figment::Error. I feel again like I'm completely misunderstanding something, because the single Error type with 3 variants now seems like a lot less effort than writing and documenting the same 3 variants of a separate error enum for each database.

Err(e) => {
error!("Failed to read SQLx config: {}", e);
return Err(rocket);
match rocket.state::<Db>() {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This feels like an abstraction has just leaked. I think we want something like https://api.rocket.rs/v0.5-rc/rocket/struct.State.html#method.get. Maybe Db::pool(&rocket); and Db::conn(&rocket).await;.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Leaking the abstraction in this particular way (putting the derived database type directly in managed state) was intentional, because I thought it was much simpler to understand. Db::pool and Db::conn already take &self, so this would logically be adding Db::pool_from_rocket(&rocket) and Db::conn_from_rocket(&rocket).await. Happy to make that change, though.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The problem is that this presents two different interfaces: an abstracted Db::fairing() and now a Db in managed state. If the user had instead been asked to do .manage(Db::init()?) or something to that effect, this would be fine. But otherwise, .attach(Db::fairing()) says nothing about something going into managed state.

Copy link
Member

@SergioBenitez SergioBenitez left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Something feels quite wrong here with the way Database is defined:

  • It's very weird that there are two error types in Database. There really needs to be one.
  • We need to specify configs per database. The existence of a common Config type in 0.4 was a mistake rooted in the inability to generalize configuration. With Figment, this mistake doesn't need to be made again. Each database type (or library, or wherever the distinction lies) needs to define its own configuration structure. This immediately resolves all weirdness around configuration and allows the user to configure all aspects of each database.
  • The Fairing struct is very odd. It effectively demands that the database type is Fn(Self::Pool) -> Self without having that fact be codified or enforced anywhere. Even so, that feels like a strange restriction. It seems we really just want some way to go from a database pool type to a structure. Perhaps the bound we want on Database is From<Self::Pool> (which would imply generating such an implementation), or maybe we want a fn new(pool: Db::Pool) -> Self; on Database. I'm not sure exactly which of these is the right approach, but the current Fairing situation feels like the wrong one.
  • Something needs to be done about the effective need for a type alias. Whether that's generating an inherent associated type or something similar, I don't know. But it needs to be trivial and concise to get a connection without adding any code.

fn pool(&self) -> &Self::Pool;

/// get().await returns a connection from the pool (or an error)
fn get(&self) -> BoxFuture<'_, Result<Connection<Self>, <Self::Pool as Pool>::GetError>> {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wanted to avoid boxing every time we retrieved a connection. If the pool type implemented an async fn get() method inherently, and codegen could call that method directly, then we would avoid the box. There's no reason to Box manually without using async_trait. This thinking relied on the Db type implementing FromRequest or coherence allowing you to generate a FromRequest for Connection<Db>.

Comment on lines +19 to +20
/// Returns a fairing that attaches this connection pool to the server.
fn fairing() -> Fairing<Self>;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This Fairing type feels pretty weird to me. It seems to me that a Database is itself a Fairing. Or something like that. There's something not quite right about the setup right now with respect to this fairing, the Fairing type, and the Connection type.

/// Create a new database fairing with the given constructor. This
/// constructor will be called to create an instance of `D` after the pool
/// is initialized and before it is placed into managed state.
pub fn new(fairing_name: &'static str, ctor: fn(D::Pool) -> D, ) -> Self {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is really weird.

}
};

Ok(rocket.manage((self.1)(pool)))
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will panic if one is already being managed. We should probably check and emit a nicer error.

Comment on lines +53 to +58
/// The error type returned by `initialize`.
type InitError: std::error::Error;

/// The error type returned by `get`.
type GetError: std::error::Error;
Copy link
Member

@SergioBenitez SergioBenitez Jun 27, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, but that doesn't mean the pool integration can't wrap those into one type.

I think there's a lot of effort right now spent on trying to creates these reusable structures like Config and Error (the least common denominators) when, in reality, each database driver (at least those from each library) will be fairly distinct. I would advocate for eschewing both of these types entirely and instead creating an Error and Config for each database variant/library as needed. With Config specifically, this is pretty important.

Comment on lines +72 to +73
async fn initialize(db_name: &str, rocket: &Rocket<Build>)
-> Result<Self, Error<Self::InitError>>;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This shouldn't be formatted this way. At worst:

Suggested change
async fn initialize(db_name: &str, rocket: &Rocket<Build>)
-> Result<Self, Error<Self::InitError>>;
async fn initialize(
db_name: &str,
rocket: &Rocket<Build>
) -> Result<Self, Error<Self::InitError>>;

But I really think this needs to be Self::Error.

#[database(name = "sqlx")]
struct Db(sqlx::SqlitePool);

type Connection = rocket_db_pools::Connection<Db>;
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not of the trait, but of the type directly.

Err(e) => {
error!("Failed to read SQLx config: {}", e);
return Err(rocket);
match rocket.state::<Db>() {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The problem is that this presents two different interfaces: an abstracted Db::fairing() and now a Db in managed state. If the user had instead been asked to do .manage(Db::init()?) or something to that effect, this would be fine. But otherwise, .attach(Db::fairing()) says nothing about something going into managed state.

@paulzhang5511
Copy link

How to test a query statement with rocket_db_pools?

@SergioBenitez
Copy link
Member

Merged in f7f068b with design changes in 5b1a04d.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Tracking Issue for Asynchronous Database Pool Support
3 participants