Update README index

2023-03-25 17:41:32 +00:00
parent 11bf00e181
commit 8e691b5681
5 changed files with 336 additions and 261 deletions
--- a/topics/databases/README.md
+++ b/topics/databases/README.md
@@ -0,0 +1,191 @@
+# Databases
+
+- [Databases](#databases)
+  - [Exercises](#exercises)
+  - [Questions](#questions)
+    - [SQL](#sql)
+    - [Time Series](#time-series)
+
+## Exercises
+
+|Name|Topic|Objective & Instructions|Solution|Comments|
+|--------|--------|------|----|----|
+| Message Board Tables  | Relational DB Tables | [Exercise](topics/databases/table_for_message_board_system.md) | [Solution](topics/databases/solutions/table_for_message_board_system.md)
+
+## Questions
+
+
+<details>
+<summary>What type of databases are you familiar with?</summary><br><b>
+
+Relational (SQL)
+NoSQL
+Time serties
+</b></details>
+
+### SQL
+
+<details>
+<summary>What is a relational database?</summary><br><b>
+
+  * Data Storage: system to store data in tables
+  * SQL: programming language to manage relational databases
+  * Data Definition Language: a standard syntax to create, alter and delete tables
+
+</b></details>
+
+<details>
+<summary>What does it mean when a database is ACID compliant?</summary><br>
+
+ACID stands for Atomicity, Consistency, Isolation, Durability. In order to be ACID compliant, the database must meet each of the four criteria
+
+**Atomicity** - When a change occurs to the database, it should either succeed or fail as a whole.
+
+For example, if you were to update a table, the update should completely execute. If it only partially executes, the
+update is considered failed as a whole, and will not go through - the DB will revert back to it's original
+state before the update occurred. It should also be mentioned that Atomicity ensures that each
+transaction is completed as it's own stand alone "unit" - if any part fails, the whole statement fails.
+
+**Consistency** - any change made to the database should bring it from one valid state into the next.
+
+For example, if you make a change to the DB, it shouldn't corrupt it. Consistency is upheld by checks and constraints that
+are pre-defined in the DB. For example, if you tried to change a value from a string to an int when the column
+should be of datatype string, a consistent DB would not allow this transaction to go through, and the action would
+not be executed
+
+**Isolation** - this ensures that a database will never be seen "mid-update" - as multiple transactions are running at
+the same time, it should still leave the DB in the same state as if the transactions were being run sequentially.
+
+For example, let's say that 20 other people were making changes to the database at the same time. At the
+time you executed your query, 15 of the 20 changes had gone through, but 5 were still in progress. You should
+only see the 15 changes that had completed - you wouldn't see the database mid-update as the change goes through.
+
+**Durability** - Once a change is committed, it will remain committed regardless of what happens
+(power failure, system crash, etc.). This means that all completed transactions
+must be recorded in non-volatile memory.
+
+Note that SQL is by nature ACID compliant. Certain NoSQL DB's can be ACID compliant depending on
+how they operate, but as a general rule of thumb, NoSQL DB's are not considered ACID compliant
+</details>
+
+<details>
+<summary>What is sharding?</summary><br><b>
+
+Sharding is a horizontal partitioning.
+
+Are you able to explain what is it good for?
+</b></details>
+
+<details>
+<summary>You find out your database became a bottleneck and users experience issues accessing data. How can you deal with such situation?</summary><br><b>
+
+Not much information provided as to why it became a bottleneck and what is current architecture, so one general approach could be<br>
+to reduce the load on your database by moving frequently-accessed data to in-memory structure.
+</b></details>
+
+<details>
+<summary>What is a connection pool?</summary><br><b>
+
+Connection Pool is a cache of database connections and the reason it's used is to avoid an overhead of establishing a connection for every query done to a database.
+</b></details>
+
+<details>
+<summary>What is a connection leak?</summary><br><b>
+
+A connection leak is a situation where database connection isn't closed after being created and is no longer needed.
+</b></details>
+
+<details>
+<summary>What is Table Lock?</summary><br><b>
+</b></details>
+
+<details>
+<summary>Your database performs slowly than usual. More specifically, your queries are taking a lot of time. What would you do?</summary><br><b>
+
+* Query for running queries and cancel the irrelevant queries
+* Check for connection leaks (query for running connections and include their IP)
+* Check for table locks and kill irrelevant locking sessions
+</b></details>
+
+<details>
+<summary>What is a Data Warehouse?</summary><br><b>
+
+"A data warehouse is a subject-oriented, integrated, time-variant and non-volatile collection of data in support of organisation's decision-making process"
+</b></details>
+
+<details>
+<summary>Explain what is a time-series database</summary><br><b>
+</b></details>
+
+<details>
+<summary>What is OLTP (Online transaction processing)?</summary><br><b>
+</b></details>
+
+<details>
+<summary>What is OLAP (Online Analytical Processing)?</summary><br><b>
+</b></details>
+
+<details>
+<summary>What is an index in a database?</summary><br><b>
+
+A database index is a data structure that improves the speed of operations in a table. Indexes can be created using one or more columns, providing the basis for both rapid random lookups and efficient ordering of access to records.
+</b></details>
+
+<details>
+<summary>What data types are there in relational databases?</summary><br><b>
+</b></details>
+
+<details>
+<summary>Explain Normalization</summary><br><b>
+
+Data that is used multiple times in a database should be stored once and referenced with a foreign key.<br>
+This has the clear benefit of ease of maintenance where you need to change a value only in a single place to change it everywhere.
+</b></details>
+
+<details>
+<summary>Explain Primary Key and Foreign Key</summary><br><b>
+
+Primary Key: each row in every table should a unique identifier that represents the row.<br>
+Foreign Key: a reference to another table's primary key. This allows you to join table together to retrieve all the information you need without duplicating data.
+</b></details>
+
+<details>
+<summary>What types of data tables have you used?</summary><br><b>
+
+  * Primary data table: main data you care about
+  * Details table: includes a foreign key and has one to many relationship
+  * Lookup values table: can be one table per lookup or a table containing all the lookups and has one to many relationship
+  * Multi reference table
+</b></details>
+
+<details>
+<summary>What is ORM? What benefits it provides in regards to relational databases usage?</summary><br><b>
+
+[Wikipedia](https://en.wikipedia.org/wiki/Object%E2%80%93relational_mapping): "is a programming technique for converting data between incompatible type systems using object-oriented programming languages"
+
+In regards to the relational databases:
+
+  * Database as code
+  * Database abstraction
+  * Encapsulates SQL complexity
+  * Enables code review process
+  * Enables usage as a native OOP structure
+</b></details>
+
+<details>
+<summary>What is DDL?</summary><br><b>
+
+[Wikipedia](https://en.wikipedia.org/wiki/Data_definition_language): "In the context of SQL, data definition or data description language (DDL) is a syntax for creating and modifying database objects such as tables, indices, and users."
+</b></details>
+
+### Time Series
+
+<details>
+<summary>What is Time Series database?</summary><br><b>
+
+A database designed specifically for time series based data.
+
+It comes with multiple optimizations:
+
+<TODO>: complete this :)
+</b></details>
--- a/topics/observability/README.md
+++ b/topics/observability/README.md
@@ -0,0 +1,88 @@
+# Observability
+
+- [Observability](#observability)
+  - [Monitoring](#monitoring)
+  - [Data](#data)
+  - [Application Performance Management](#application-performance-management)
+
+<details>
+<summary>What's Observability?</summary><br><b>
+</b></details>
+
+## Monitoring
+
+<details>
+<summary>What's monitoring? How is it related to Observability?</summary><br><b>
+
+Google: "Monitoring is one of the primary means by which service owners keep track of a system’s health and availability".
+</b></details>
+
+<details>
+<summary>What types of monitoring outputs are you familiar with and/or used in the past?</summary><br><b>
+
+Alerts<br>
+Tickets<br>
+Logging<br>
+</b></details>
+
+## Data
+
+<details>
+<summary>Can you mention what type of things are often montiored in the IT industry?</summary><br><b>
+
+- Hardware (CPU, RAM, ...)
+- Infrastructure (Disk capacity, Network latency, ...)
+- App (Status code, Errors in logs, ...)
+</b></details>
+
+<details>
+<summary>Explain "Time Series" data</summary><br><b>
+
+Time series data is sequenced data, measuring certain parameter in ordered (by time) way.
+
+An example would be CPU utilization every hour:
+
+```
+08:00   17
+09:00   22
+10:00   91
+```
+</b></details>
+
+<details>
+<summary>Explain data aggregation</summary><br><b>
+
+In monitoring, aggregating data is basically combining collection of values. It can be done in different ways like taking the average of multiple values, the sum of them, the count of many times they appear in the collection and other ways that mainly depend on the type of the collection (e.g. time-series would be one type).
+
+</b></details>
+
+
+## Application Performance Management
+
+<details>
+<summary>What is Application Performance Management?</summary><br><b>
+
+- IT metrics translated into business insights
+- Practices for monitoring applications insights so we can improve performances, reduce issues and improve overall user experience
+
+</b></details>
+
+<details>
+<summary>Name three aspects of a project you can monitor with APM (e.g. backend)</summary><br><b>
+
+- Frontend
+- Backend
+- Infra
+- ...
+
+</b></details>
+
+<details>
+<summary>What can be collected/monitored to perform APM monitoring?</summary><br><b>
+
+- Metrics
+- Logs
+- Events
+- Traces
+
+</b></details>