user_guide:howto:polydb_tutorial

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
user_guide:howto:polydb_tutorial [2020/08/31 09:38] – [Inserting new Data] paffenholzuser_guide:howto:polydb_tutorial [2021/06/19 21:33] (current) – correction of a typo schroeter
Line 1: Line 1:
 ====== Introduction to polyDB ====== ====== Introduction to polyDB ======
  
-This tutorial explains how to use the ''polyDB'' from within polymake. If you want to access the data without using ''polymake'' please check [[user_guide:howto:polydb_api|here]].+This tutorial explains how to access the [[:polydb|polyDB database]] from within polymake using the extension ''polyDB''. It comes bundled with ''polymake'', so there is no need to install extra software, except for the MongoDB.pm perl package. (This tutorial is for polymake version 4 and later. The old version is [[poly_db_tutorial|here]], but this needs also an old version of the database). If you encounter any errors or problems concerning ''polyDB'' or the extension, please don't hesitate to [[https://forum.polymake.org/|ask in the forum]]. 
  
-The polyDB extension provides access to the [[:polydb|polyDB database]]. It comes bundled with ''polymake'', so there is no need to install extra software, except for the MongoDB.pm perl package. If you encounter any errors or problems concerning polyDB, please don't hesitate to [[https://forum.polymake.org/|ask in the forum]].  +The ''polymake'' extension is not necessary to use the data. You can access the data also 
- +
-However, the ''polymake'' extension is not necessary to use the data. You can access the data also +
   * via the [[https//db.polymake.org|web interface]]   * via the [[https//db.polymake.org|web interface]]
   * using the mongo shell directly or via any gui (see **below** for more details (to be written))   * using the mongo shell directly or via any gui (see **below** for more details (to be written))
-  * using the polyDB REST API+  * using the [[user_guide:howto:polydb_api|polyDB REST API]].  
 Software developers can also include access to polyDB using any of the many MongoDB interfaces and use the data directly in their programs. The few structural assumptions made in the database that you need to follow in your development are explained **below** (to be written). Software developers can also include access to polyDB using any of the many MongoDB interfaces and use the data directly in their programs. The few structural assumptions made in the database that you need to follow in your development are explained **below** (to be written).
  
 ===== Initializing Access ===== ===== Initializing Access =====
  
-You need a **database connection** before you can work with ''polyDB'' (and a working internet connection if you want to access a database that is not stored in tour local computer). For the main instance of ''polyDB'' you can just call+You need a **database connection** before you can work with ''polyDB'' (and a working internet connection if you want to access a database that is not stored in your local computer). For the main instance of ''polyDB'' you can just call
 <code> <code>
   $polydb = polyDB();   $polydb = polyDB();
Line 21: Line 20:
   $polydb = polyDB(username=><username>, passwd=><password>);   $polydb = polyDB(username=><username>, passwd=><password>);
 </code> </code>
-You can also store this in the custom variables <code>$PolyDB::default::db_user</code> and <code>$PolyDB::default::db_pwd</code>... Then you don't have to specify them in the connection method, they will be picked up automatically. +You can also store this in the custom variables <code>$PolyDB::default::db_user</code> and <code>$PolyDB::default::db_pwd</code>... Then you don't have to specify them in the connection method, they will be picked up automatically and are preserved over sessions
  
 ===== Learning which Data is Available ===== ===== Learning which Data is Available =====
Line 52: Line 51:
 ===== Reading Data ===== ===== Reading Data =====
  
-Before you can access a collection you need to create a connection to it with the method ''get_collection''So, if you want to acess the list of 0/1-polytopes up to combinatorial equivalence you would call+Before you can access a collection you need to establish **connection handle** with the method ''get_collection''For example, if you want to acess the list of 0/1-polytopes up to combinatorial equivalence you would call
 <code> <code>
 $collection=$polydb->get_collection("Polytopes.Combinatorial.01Polytopes"); $collection=$polydb->get_collection("Polytopes.Combinatorial.01Polytopes");
Line 61: Line 60:
   * ''find'': to obtain a cursor on objects satisfying the provided query   * ''find'': to obtain a cursor on objects satisfying the provided query
   * ''distinct'': to get an array of distinct values for a property among all objects satisfying the provided query   * ''distinct'': to get an array of distinct values for a property among all objects satisfying the provided query
-  * ''aggregate'': to appy complex aggregation pipelines on a collection+  * ''aggregate'': to apply complex aggregation pipelines on a collection
  
-The main argument of the first three functions is a MongoDb query hash. You can use the full MongoDB query syntax as decribed [[https://docs.mongodb.com/manual/tutorial/query-documents/|here]]. Note that ''polymake'' uses the perl interface, so the query should be given as a perl hash instead of a json document (it mostly suffices to use ''=>'' instead of '':''). For some perl examples see [[https://metacpan.org/pod/MongoDB::Tutorial#Retrieving-Documents|here]]. Basic queries for one or more parameter look like<code>{"N_VERTICES"=>10}+The main argument of the first three functions is a MongoDD query hash. You can use the full MongoDB query syntax as decribed [[https://docs.mongodb.com/manual/tutorial/query-documents/|here]]. Note that ''polymake'' uses the perl interface interface of MongoDB, so the query should be given as a perl hash instead of a json document (it mostly suffices to use ''=>'' instead of '':''). For some perl examples see [[https://metacpan.org/pod/MongoDB::Tutorial#Retrieving-Documents|here]]. Basic queries for one or more parameter look like<code>{"N_VERTICES"=>10}
 {"DIM"=>5, "N_FACETS"=>7} {"DIM"=>5, "N_FACETS"=>7}
 </code> </code>
-Bounds or Ranges can be defined with the operators ''&gt'', ''&gte'', ''&lt'' and ''&lte''. For example<code>{"N_VERTICES"=> { "&gte" => 5, "&lte" => 10 } }</code> returns documents where the number of vertices is between five and ten (including the boundaries). More operators can be found [[https://docs.mongodb.com/manual/reference/operator/query/#query-selectors|here]]. You can also query for elements in arrays either somewhere in the array or at a specific position.+Bounds or ranges can be defined with the operators ''&gt'', ''&gte'', ''&lt'' and ''&lte''. For example<code>{"N_VERTICES"=> { "&gte" => 5, "&lte" => 10 } }</code> returns documents where the number of vertices is between five and ten (including the boundaries). More operators can be found [[https://docs.mongodb.com/manual/reference/operator/query/#query-selectors|here]]. You can also query for elements in arrays either somewhere in the array or at a specific position.
  
 The last function allows to pass an aggregation pipeline as described [[https://docs.mongodb.com/manual/aggregation/|here]] (note again that the pipeline needs to be passed as a perl hash instead of a json document).  The last function allows to pass an aggregation pipeline as described [[https://docs.mongodb.com/manual/aggregation/|here]] (note again that the pipeline needs to be passed as a perl hash instead of a json document). 
Line 85: Line 84:
   * ''limit=>$n'': returns at most ''$n'' documents   * ''limit=>$n'': returns at most ''$n'' documents
  
-==== Access Credentials ====+You can reset the cursor with<code>$cur->reset</code>if you want to iterate over the results again. 
  
-There are two pairs of custom variables for access credentials: 
-  * ''$PolyDB::default::db_{user,pwd}'': For a user that has read access (usually set to ''polymake''/''database'' for all public collections). Set this to your private user if you have been granted access to private collections 
-  * ''$PolyDB::default::db_collection_admin_{user,pwd}'': For credentials with write access to collections. Note that the first pair is not checked for write access even if the user given there has write access. You must set this pair for write access.  
 ===== Inserting new Data ===== ===== Inserting new Data =====
  
Line 113: Line 109:
     "uri" : "https://polymake.org"     "uri" : "https://polymake.org"
 }</code> }</code>
-  * You need to provide a full json schema describing your data. If you have a polymake object with the data you want, then the function ''create_restrictive_schema'' can help you with this and provide an initial template. In ''polyDB'' the json schema is stored as the ''schema'' entry of a document also specifying the section and collection in the entries ''section'' and ''collection''Both JSON schemas and MongoDB use ''$'' as special character to specify functions. This leads to a clash when you try to store a json schema in MongoDB. Hence, in the schema document we replace ''$''$ with ''__'' for storing and restore this when reading the schema.  +  * You need to provide a full json schema describing your data. If you have a polymake object with the data you want, then the function ''create_restrictive_schema'' can help you with this and provide an initial template. Here is part of the schema for 0/1-Polytopes.<code>
 +   "type": "object", 
 +   "$schema": "http://json-schema.org/draft-07/schema#", 
 +   "properties":
 +      "SELF_DUAL":
 +         "$ref": "#/definitions/common-Bool" 
 +      }, 
 +      "VERTICES":
 +         "$ref": "#/definitions/common-Matrix-Rational-NonSymmetric" 
 +      }, 
 +      "_ns":
 +         "additionalProperties": false, 
 +         "properties":
 +            "polymake":
 +               "type": "array", 
 +               "additionalItems": false, 
 +               "items":
 +                  { 
 +                     "const": "https://polymake.org" 
 +                  }, 
 +                  { 
 +                     "const": "3.5" 
 +                  } 
 +               ] 
 +            } 
 +         }, 
 +         "type": "object" 
 +      } 
 +   }, 
 +   "additionalProperties": false, 
 +   "required":
 +      "_ns", 
 +      "SELF_DUAL", 
 +      "VERTICES" 
 +   ], 
 +   "definitions":
 +      "common-Rational":
 +         "pattern": "^-?(\\d+(/\\d+)?|inf)$", 
 +         "type": "string" 
 +      }, 
 +      "common-Matrix-Rational-NonSymmetric":
 +         "type": "array", 
 +         "items":
 +            "oneOf":
 +               { 
 +                  "$ref": "#/definitions/common-Vector-Rational" 
 +               }, 
 +               { 
 +                  "type": "object", 
 +                  "properties":
 +                     "cols":
 +                        "type": "integer", 
 +                        "minimum":
 +                     } 
 +                  }, 
 +                  "required":
 +                     "cols" 
 +                  ], 
 +                  "additionalProperties": false 
 +               } 
 +            ] 
 +         } 
 +      }, 
 +      "common-Bool":
 +         "type": "boolean" 
 +      }, 
 +      "common-Vector-Rational":
 +         "items":
 +            "$ref": "#/definitions/common-Rational" 
 +         }, 
 +         "type": "array" 
 +      } 
 +   } 
 +}</code>This schema needs one special property ''_polyDB'', which specifies some information on the document. Among the properties you should have<code>"_polyDB":
 +         "required":
 +            "collection", 
 +            "creation_date", 
 +            "section", 
 +            "uri", 
 +            "version" 
 +         ], 
 +         "type": "object", 
 +         "properties":
 +            "uri":
 +               "type": "string" 
 +            }, 
 +            "collection": { 
 +               "type": "string" 
 +            }, 
 +            "version":
 +               "type": "string", 
 +               "pattern": "^[0-9]{1,2}.[0-9]{1,2}$
 +            }, 
 +            "creation_date":
 +               "pattern": "^[1-9][0-9]{3}-[0-9]{2}-[0-9]{2}$", 
 +               "type": "string" 
 +            }, 
 +            "section":
 +               "type": "string" 
 +            } 
 +         } 
 +      }</code> and should be required property with an entry also in the properties listed in ''_attr'' as <code> 
 +      "_attrs":
 +         "additionalProperties": false, 
 +         "properties":
 +            "_polyDB":
 +               "properties":
 +                  "attachment":
 +                     "const": true 
 +                  } 
 +               } 
 +            } 
 +         }, 
 +         "type": "object" 
 +      }</code> 
   * If you want you collection to be included in the ''db_info'' command you need a json document describing you collection in the form<code>{   * If you want you collection to be included in the ''db_info'' command you need a json document describing you collection in the form<code>{
    "collection" : "TOM",    "collection" : "TOM",
Line 130: Line 241:
    ]    ]
 }</code> }</code>
-  * If you want to place this also in a new section, then also this (and all new subsections created) need a description document. +  * If you want to place this also in a new section, then also this (and all new subsections created) need a description document. However, a description for a section can only be edited by an administrator, so just send us the description and we will add it
  
-Meta information, schema and documentation are stored with the functions+Meta information, schema and documentation are stored with the methods
 <code> <code>
-db_set_collection_meta_information($meta);  +$collection->set_info($meta);  
-db_set_collection_schema($schema);  +$collection->set_schema($schema);  
-db_write_collection_metadata(file=><file>);+$collection->set_collection_doc($doc, replace=>true/false, update=>true/false);
 </code> </code>
-where in the first two functions the argument is either a perl hash or the name of a file containing a json document.+where the first argument is the data as a perl hash. 
  
-Insertion is done with the function ''db_insert''. This function either takes a file, a single ''polymake'' big object or an array of such as first argument and writes this data into the collection specified by the options ''section'' and ''collection''. As for queries you can set these via custom variables and then don't need to specify them in ''db_insert''Currently you need to specify ''use_schema =1'' in the command to use the meta information and the json schema you provided. Further options are +Insertion of data is done with the method ''insert''. This function either takes a file, a single ''polymake'' big object or an array of such as first argument and writes this data into the collection specified by the options ''section'' and ''collection'' (these can also be specified with the same custom variables as for queries)This has some more options, see <code>help "insert";</code>in the polymake shell.
-  * ''type_information'': to specify a different json schema as a perl hash +
-  * ''replace'': to replace an existing document +
-  * ''noinsert'': For a dry run of the command.+
  
  
Line 150: Line 258:
 === Starting a new collection === === Starting a new collection ===
  
-A new collection is started with the command<code>db_admin_initiate_collection(section=><section>, collection=><collection>);</code> where you can omit the two options if you have set the section and collection name with the two custom variables ''$PolyDB::default::db_section_name'' and ''$PolyDB::default::db_collection_name'' before. If the collection should not be public, then also pas the option ''public=>false''. For a public collection the read access role of the new collection is added to the default role ''polymakeUser'' which is granted to every user of ''polyDB''. One can add this later if one wants to build up and test the collection befor making it publicly available. +A new collection is started with the command<code>$polydb->initiate_collection(section=><section>, collection=><collection>);</code>If the collection should not be public, then also pas the option ''public=>false''. For a public collection the read access role of the new collection is added to the default role ''polymakeUser'' which is granted to every user of ''polyDB''. One can add this later if one wants to build up and test the collection befor making it publicly available. 
  
-If this creates new intermediate subsections you should set the section documentation with <code>db_write_section_metadata(file=><file>);</code> so that the new collection appears in the list printed by ''db_info'' for all users with sufficient permissions. +If this creates new intermediate subsections you should set the section documentation with <code>$polydb->set_section_doc($doc, section=>...);</code> so that the new collection appears in the list printed by ''db_info'' for all users with sufficient permissions. 
  
 Note that the first command essentially only creates two new roles in MongoDB, one for read access to the collection (and all sections up to the root) and one for write access to the collection (and only to the collection, not to the sections). The actual collections are only created once the first document is written into the collection. This implies that collections will not be listed with ''db_info'' if any of the intermediate sections has no documentation, as then the collection where this is stored is not created.  Note that the first command essentially only creates two new roles in MongoDB, one for read access to the collection (and all sections up to the root) and one for write access to the collection (and only to the collection, not to the sections). The actual collections are only created once the first document is written into the collection. This implies that collections will not be listed with ''db_info'' if any of the intermediate sections has no documentation, as then the collection where this is stored is not created. 
  
-Any user that has the write access rule (which you can assign with ''db_admin_add_user_to_collection'') can insert, delete and modify documents in the collection, the meta information, the schema and to documentation of the collection. +You can add users for read acces with the method ''add_user_to_collection''. Note that this is only useful if the collection is not public. Any user that has the write access rule (which you can assign with ''$polydb->add_user_to_collection(user=>..., collection=>..., admin=>true'') can insert, delete and modify documents in the collection, the meta information, the schema and to documentation of the collection. Note that in this method the collection must be given fully qualified, e.g. as ''Polytopes.Combinatorial.01Polytopes''.
  
-=== Access Credentials === 
  
-There are three pairs of custom variables for access credentials: 
-  * ''$PolyDB::default::db_{user,pwd}'': For a user that has read access (usually set to ''polymake''/''database'' for all public collections) 
-  * ''$PolyDB::default::db_collection_admin_{user,pwd}'': For credentials with write access to collections 
-  * ''$PolyDB::default::db_admin_{user,pwd}'': For MongoDB admin credentials. This is used instead of the collection admin credentials if those are not set.  
  • user_guide/howto/polydb_tutorial.1598866694.txt.gz
  • Last modified: 2020/08/31 09:38
  • by paffenholz