devops-exercises

⚠️ You can use these for preparing for an interview but most of the questions and exercises don't represent an actual interview. Please read FAQ page for more details

👥 Join our DevOps community where we have discussions and share resources on DevOps

📝 You can add more questions and exercises by submitting pull requests :) Read about contribution guidelines here

DevOps	CI/CD	Git	Ansible	Network	Linux
Programming	Python	Go	Shell Scripting	Kubernetes	Prometheus
Cloud	AWS	Azure	Google Cloud Platform	OpenStack	Security
Operating System	Monitoring	Elastic	Virtualization	DNS	Misc
Testing	Databases	Regex	System Design	Hardware	Big Data
Certificates	Containers	SQL	OpenShift	Storage	HR
Terraform	Mongo	Puppet	Distributed	Questions you can ask	Perl

Network

What do you need in order to communicate?

A common language (for the two ends to understand)

A way to address who do you want to communicate with

A Connection (so the content of of the communication can reach the recipients)

What is TCP/IP?

A set of protocols that define how two or more devices can communicate with each other. To learn more about TCP/IP, read here

What is Ethernet?

Ethernet simply refers to the most common type of Local Area Network (LAN) used today. A LAN—in contrast to a WAN (Wide Area Network), which spans a larger geographical area—is a connected network of computers in a small area, like your office, college campus, or even home.

What is a MAC address? What is it used for?

A MAC address is a unique identification number or code used to identify individual devices on the network.

Packets that are sent on the ethernet are always coming from a MAC address and sent to a MAC address. If a network adapter is receiving a packet, it is comparing the packet’s destination MAC address to the adapter’s own MAC address.

When is this MAC address used?: ff:ff:ff:ff:ff:ff

When a device sends a packet to the broadcast MAC address (FF:FF:FF:FF:FF:FF), it is delivered to all stations on the local network. Ethernet broadcasts are used to resolve IP addresses to MAC addresses (by ARP) at the datalink layer .

What is an IP address?

An Internet Protocol address (IP address) is a numerical label assigned to each device connected to a computer network that uses the Internet Protocol for communication.An IP address serves two main functions: host or network interface identification and location addressing.

Explain subnet mask and given an example

A Subnet mask is a 32-bit number that masks an IP address, and divides the IP address into network address and host address. Subnet Mask is made by setting network bits to all "1"s and setting host bits to all "0"s. Within a given network, two host addresses are reserved for special purpose, and cannot be assigned to hosts. The "0" address is assigned a network address and "255" is assigned to a broadcast address, and they cannot be assigned to hosts.

For Example

| Address Class | No of Network Bits | No of Host Bits | Subnet mask | CIDR notation | | ------------- | ------------------ | --------------- | --------------- | ------------- | | A | 8 | 24 | 255.0.0.0 | /8 | | A | 9 | 23 | 255.128.0.0 | /9 | | A | 12 | 20 | 255.240.0.0 | /12 | | A | 14 | 18 | 255.252.0.0 | /14 | | B | 16 | 16 | 255.255.0.0 | /16 | | B | 17 | 15 | 255.255.128.0 | /17 | | B | 20 | 12 | 255.255.240.0 | /20 | | B | 22 | 10 | 255.255.252.0 | /22 | | C | 24 | 8 | 255.255.255.0 | /24 | | C | 25 | 7 | 255.255.255.128 | /25 | | C | 28 | 4 | 255.255.255.240 | /28 | | C | 30 | 2 | 255.255.255.252 | /30 |

What is a private IP address? In which scenarios/system designs, one should use it?

What is a public IP address? In which scenarios/system designs, one should use it?

Explain the OSI model. What layers there are? What each layer is responsible for?

Application: user end (HTTP is here)

Presentation: establishes context between application-layer entities (Encryption is here)

Session: establishes, manages and terminates the connections

Transport: transfers variable-length data sequences from a source to a destination host (TCP & UDP are here)

Network: transfers datagrams from one network to another (IP is here)

Data link: provides a link between two directly connected nodes (MAC is here)

Physical: the electrical and physical spec the data connection (Bits are here)

You can read more about the OSI model in penguintutor.com

For each of the following determine to which OSI layer it belongs:

Error correction
Packets routing
Cables and electrical signals
MAC address
IP address
Terminate connections
3 way handshake

Error correction

Packets routing - Network

Cables and electrical signals - Physical

MAC address - Data link

IP address - Network

Terminate connections - Session

3 way handshake - Transport

What delivery schemes are you familiar with?

Unitcast: One to one communication where there is one sender and one receiver.

Broadcast: Sending a message to everyone in the network. The address ff:ff:ff:ff:ff:ff is used for broadcasting. Two common protocols which use broadcast are ARP and DHCP.

Multicast: Sending a message to a group of subscribers. It can be one-to-many or many-to-many.

What is CSMA/CD? Is it used in modern ethernet networks?

CSMA/CD stands for Carrier Sense Multiple Access / Collision Detection. Its primarily focus it to manage access to shared medium/bus where only one host can transmit at a given point of time.

CSMA/CD algorithm:

Before sending a frame, it checks whether another host already transmitting a frame.

If no one transmitting, it starts transmitting the frame.

If two hosts transmitted at the same time, we have a collision.

Both hosts stop sending the frame and they send to everyone a 'jam signal' notifying everyone that a collision occurred

They are waiting for a random time before sending again

Once each host waited for a random time, they try to send the frame again and so the

Describe the following network devices and the difference between them:

router
switch
hub

How does a router works?

A router is a physical or virtual appliance that passes information between two or more packet-switched computer networks. A router inspects a given data packet's destination Internet Protocol address (IP address), calculates the best way for it to reach its destination and then forwards it accordingly.

What is NAT?

Network Address Translation (NAT) is a process in which one or more local IP address is translated into one or more Global IP address and vice versa in order to provide Internet access to the local hosts.

What is a proxy? How does it works? What do we need it for?

A proxy server acts as a gateway between you and the internet. It’s an intermediary server separating end users from the websites they browse.

If you’re using a proxy server, internet traffic flows through the proxy server on its way to the address you requested. The request then comes back through that same proxy server (there are exceptions to this rule), and then the proxy server forwards the data received from the website to you.

roxy servers provide varying levels of functionality, security, and privacy depending on your use case, needs, or company policy.

What is TCP? How does it works? What is the 3 way handshake?

TCP 3-way handshake or three-way handshake is a process which is used in a TCP/IP network to make a connection between server and client.

A three-way handshake is primarily used to create a TCP socket connection. It works when:

A client node sends a SYN data packet over an IP network to a server on the same or an external network. The objective of this packet is to ask/infer if the server is open for new connections.

The target server must have open ports that can accept and initiate new connections. When the server receives the SYN packet from the client node, it responds and returns a confirmation receipt – the ACK packet or SYN/ACK packet.

The client node receives the SYN/ACK from the server and responds with an ACK packet.

What is round-trip delay or round-trip time?

From wikipedia: "the length of time it takes for a signal to be sent plus the length of time it takes for an acknowledgement of that signal to be received"

Bonus question: what is the RTT of LAN?

How does SSL handshake work?

What is the difference between TCP and UDP?

TCP establishes a connection between the client and the server to guarantee the order of the packages, on the other hand, UDP does not establish a connection between client and server and doesn't handle package order. This makes UDP more lightweight than TCP and a perfect candidate for services like streaming.

Penguintutor.com provides a good explanation.

What TCP/IP protocols are you familiar with?

Explain "default gateway"

A default gateway serves as an access point or IP router that a networked computer uses to send information to a computer in another network or the internet.

What is ARP? How does it works?

ARP stands for Address Resolution Protocol. When you try to ping an IP address on your local network, say 192.168.1.1, your system has to turn the IP address 192.168.1.1 into a MAC address. This involves using ARP to resolve the address, hence its name.

Systems keep an ARP look-up table where they store information about what IP addresses are associated with what MAC addresses. When trying to send a packet to an IP address, the system will first consult this table to see if it already knows the MAC address. If there is a value cached, ARP is not used.

What is TTL? What does it helps to prevent?

What is DHCP? How does it works?

It stands for Dynamic Host Configuration Protocol, and allocates IP addresses, subnet masks and gateways to hosts. This is how it works:

A host upon entering a network, broadcasts a message in search of a DHCP server (DHCP DISCOVER)

An offer message is sent back by the DHCP server as a packet containing lease time, subnet mask, IP addresses, etc (DHCP OFFER)

Depending on which offer accepted, the client sends back a reply broadcast letting all DHCP servers know (DHCP REQUEST)

Server sends an acknowledgment (DHCP ACK)

Read more here

Can you have two DHCP servers in the same network? How it works?

What is SSL tunneling? How does it works?

What is a socket? Where can you see the list of sockets in your system?

What is IPv6? Why should we consider using it if we have IPv4?

What is VLAN?

What is MTU?

What happens if you send a packet that is bigger than the MTU?

True or False?. Ping is using UDP because it doesn't care about reliable connection

What is SDN?

What is ICMP? What is it used for?

What is NAT? How does it work?

NAT stands for network address translation. It’s a way to map multiple local private addresses to a public one before transferring the information. Organizations that want multiple devices to employ a single IP address use NAT, as do most home routers. For example, your computer's private IP could be 192.168.1.100, but your router maps the traffic to it's public IP (e.g. 1.1.1.1). Any device on the internet would see the traffic coming from your public IP (1.1.1.1) instead of your private IP (192.168.1.100).

Which factors affect network performances

Which port number is used in each of the following protocols?:

SSH
SMTP
HTTP
DNS
HTTPS
FTP
SFTP

SSH - 22

SMTP - 25

HTTP - 80

DNS - 53

HTTPS - 443

FTP - 21

SFTP - 22

Network - Data and Control planes

What "control plane" refers to?

The control plane is the part of the network that decides how to route and forward packets to a different location.

What "data plane" refers to?

The data plane is the part of the network that actually forwards the data/packets.

What "management plane" refers to?

Refers to monitoring and management functions.

To which plane (data, control, ...) is creating routing tables belongs to?

Control Plane.

Explain Spanning Tree Protocol (STP)

What is link aggregation? Why is it used?

What is Asymmetric Routing? How do deal with it?

What overlay (tunnel) protocols are you familiar with?

What is GRE? How does it works?

What is VXLAN? How does it works?

What is SNAT?

Explain OSPF

What is latency?

What is bandwidth?

What is throughput?

When performing a search query, what is more important, latency or throughput? And how to assure that what managing global infrastructure?

Latency. To have a good latency, a search query should be forwarded to the closest datacenter.

When uploading a video, what is more important, latency or throughput? And how to assure that?

Throughput. To have a good throughput, the upload stream should be routed to an underutilized link.

What other considerations (except latency and throughput) are there when forwarding requests?

Keep caches updated (which means the request could be forwarded not to the closest datacenter)

Explain Spine & Leaf

What is Network Congestion? What can cause it?

What can you tell me about UDP packet format? What about TCP packet format? How is it different?

What is the exponential backoff algorithm? Where is it used?

Using Hamming code, what would be the code word for the following data word 100111010001101?

00110011110100011101

Give examples of protocols found in the application layer

Hypertext Transfer Protocol (HTTP) - used for the webpages on the internet

Simple Mail Transfer Protocol (SMTP) - email transmission

Telecommunications Network - (TELNET) - terminal emulation to allow client access to telnet server

File Transfer Protocol (FTP) - facilitates transfer of files between any two machines

Domain Name System (DNS) - domain name translation

Dynamic Host Configuration Protocol (DHCP) - allocates IP addresses, subnet masks and gateways to hosts

Simple Network Management Protocol (SNMP) - gathers data of devices on the network

Give examples of protocols found in the network Layer

Internet Protocol (IP) - assists in routing packets from one machine to another

Internet Control Message Protocol (ICMP) - lets one know what is going such as error messages and debugging information

What is HSTS?

HTTP Strict Transport Security is a web server directive that informs user agents and web browsers how to handle its connection through a response header sent at the very beginning and back to the browser. This forces connections over HTTPS encryption, disregarding any script's call to load any resource in that domain over HTTP.

What is the difference if any between SSL and TLS?

Network - Misc

What is the Internet? Is it the same as the World Wide Web?

The internet refers to network of networks, transferring huge amounts of data around the globe.
The World Wide Web is an application running on millions of server, on top of the internet, accessed through what is know as the web browser

What is the ISP?

ISP (Internet Service Provider) is the local internet company provider.

Linux

Linux Master Application

A completely free application for testing your knowledge on Linux

Linux Self Assessment

What is your experience with Linux?

Only you know :)

For example:

Administration

Troubleshooting & Debugging

Storage

Networking

Development

Deployments

Explain what each of the following commands does and give an example on how to use it:

touch
ls
rm
cat
cp
mkdir
pwd
cd

touch - update file's timestamp. More commonly used for creating files

ls - listing files and directories

rm - remove files and directories

cat - create, view and concatenate files

cp - copy files and directories

mkdir - create directories

pwd - print current working directory (= at what path the user currently located)

cd - change directory

What each of the following commands does?

cd /
cd ~
cd
cd ..
cd .
cd -

cd / -> change to the root directory

cd ~ -> change to your home directory

cd -> change to your home directory

cd .. -> change to the directory above your current i.e parent directory

cd . -> change to the directory you currently in

cd - -> change to the last visited path

Some of the commands in the previous question can be run with the -r/-R flag. What does it do? Give an example to when you would use it

The -r (or -R in some commands) flag allows the user to run a certain command recursively. For example, listing all the files under the following tree is possible when done recursively (ls -R):

/dir1/ dir2/ file1 file2 dir3/ file3

To list all the files, one can run ls -R /dir1

Explain each field in the output of `ls -l` command

It shows a detailed list of files in a long format. From the left:

file permissions, number of links, owner name, owner group, file size, timestamp of last modification and directory/file name

What are hidden files/directories? How to list them?

These are files directly not displayed after performing a standard ls direct listing. An example of these files are .bashrc which are used to execute some scripts. Some also store configuration about services on your host like .KUBECONFIG. The command used to list them is, ls -a

What do > and < do in terms of input and output for programs?

They take in input (<) and output for a given file (>) using stdin and stdout.

myProgram < input.txt > executionOutput.txt

Explain what each of the following commands does and give an example on how to use it:

sed
grep
cut
awk

sed: a stream editor. Can be used for various purposes like replacing a word in a file: sed -i s/salad/burger/g

grep: a search tool. Used to search, count or match a text in a file:

searching for any line that contains a word in a file: grep 'word' file.md

or displaying the total number of times a string appears in a file: grep -c 'This is a string' file.md

cut: a tool for cutting out selected portions of each line of a file:
- syntax: cut OPTION [FILE]
  - cutting first two bytes from a word in a file: cut -b 1-2 file.md, output: wo

How to rename the name of a file or a directory?

Using the mv command.

Specify which command would you use (and how) for each of the following scenarios

Remove a directory with files
Display the content of a file
Provides access to the file /tmp/x for everyone
Change working directory to user home directory
Replace every occurrence of the word "good" with "great" in the file /tmp/y

rm -rf dir

cat or less

chmod 777 /tmp/x

cd ~

sed -i s/good/great/g /tmp/y

How can you check what is the path of a certain command?

whereis

which

What is the difference between these two commands? Will it result in the same output?

echo hello world
echo "hello world"

The echo command receives two separate arguments in the first execution and in the second execution it gets one argument which is the string "hello world". The output will be the same.

Explain piping. How do you perform piping?

Using a pipe in Linux, allows you to send the output of one command to the input of another command. For example: cat /etc/services | wc -l

Fix the following commands:

sed "s/1/2/g' /tmp/myFile
find . -iname *.yaml -exec sed -i "s/1/2/g" {} ;

sed 's/1/2/g' /tmp/myFile # sed "s/1/2/g" is also fine find . -iname "*.yaml" -exec sed -i "s/1/2/g" {} \;

How to check which commands you executed in the past?

history command or .bash_history file

Running the command df you get "command not found". What could be wrong and how to fix it?

Most likely the default/generated $PATH was somehow modified or overridden thus not containing /bin/ where df would normally go. This issue could also happen if bash_profile or any configuration file of your interpreter was wrongly modified, causing erratics behaviours. You would solve this by fixing your $PATH variable:

As to fix it there are several options:

Manually adding what you need to your $PATH PATH="$PATH":/user/bin:/..etc

You have your weird env variables backed up.

You would look for your distro default $PATH variable, copy paste using method #1

Note: There are many ways of getting errors like this: if bash_profile or any configuration file of your interpreter was wrongly modified; causing erratics behaviours, permissions issues, bad compiled software (if you compiled it by yourself)... there is no answer that will be true 100% of the time.

How do you schedule tasks periodically?

You can use the commands cron and at. With cron, tasks are scheduled using the following format:

*/30 * * * * bash myscript.sh Executes the script every 30 minutes.

The tasks are stored in a cron file, you can write in it using crontab -e

Alternatively if you are using a distro with systemd it's recommended to use systemd timers.

Linux - I/O Redirection

Explain Linux I/O redirection

Demonstrate Linux output redirection

ls > ls_output.txt

Demonstrate Linux stderr output redirection

yippiekaiyay 2> ls_output.txt

Demonstrate Linux stderr to stdout redirection

yippiekaiyay 1>&2

What is the result of running the following command? yippiekaiyay 1>&2 die_hard

An output similar to: yippikaiyay: command not found...
The file die_hard will not be created

Linux FHS

In Linux FHS (Filesystem Hierarchy Standard) what is the /?

The root of the filesystem. The beginning of the tree.

What is stored in each of the following paths?

/bin, /sbin, /usr/bin and /usr/sbin
/etc
/home
/var
/tmp

binaries

configuration files

home directories of the different users

files that tend to change and be modified like logs

temporary files

What is special about the /tmp directory when compared to other directories?

/tmp folder is cleaned automatically, usually upon reboot.

What kind of information one can find in /proc?

Can you create files in /proc?

In which path can you find the system devices (e.g. block storage)?

Linux - Permissions

How to change the permissions of a file?

Using the chmod command.

What does the following permissions mean?:

777 - You give the owner, group and other: Execute (1), Write (2) and Read (4); 4+2+1 = 7. 644 - Owner has Read (4), Write (2), 4+2 = 6; Group and Other have Read (4). 750 - Owner has x+r+w, Group has Read (4) and Execute (1); 4+1 = 5. Other have no permissions.

What this command does? chmod +x some_file

It adds execute permissions to all sets i.e user, group and others

Explain what is setgid and setuid

setuid is a linux file permission that permits a user to run a file or program with the permissions of the owner of that file. This is possible by elevation of current user privileges.

setgid is a process when executed will run as the group that owns the file.

What is the purpose of sticky bit?

Its a bit that only allows the owner or the root user to delete or modify the file.

What the following commands do?

chmod
chown
chgrp

chmod - changes access permissions to files system objects

chown - changes the owner of file system files and directories

chgrp - changes the group associated with a file system object

What is sudo? How do you set it up?

True or False? In order to install packages on the system one must be the root user or use the sudo command

True

Explain what are ACLs. For what use cases would you recommend to use them?

You try to create a file but it fails. Name at least three different reason as to why it could happen

No more disk space

No more inodes

No permissions

A user accidentally executed the following chmod -x $(which chmod). How to fix it?

Linux - systemd

What is systemd?

Systemd is a daemon (System 'd', d stands for daemon).
A daemon is a program that runs in the background without direct control of the user, although the user can at any time talk to the daemon.

systemd has many features such as user processes control/tracking, snapshot support, inhibitor locks..

If we visualize the unix/linux system in layers, systemd would fall directly after the linux kernel.
Hardware -> Kernel -> Daemons, System Libraries, Server Display.

How to start or stop a service?

To start a service: systemctl start <service name> To stop a service: systemctl stop <service name>

How to check the status of a service?

systemctl status <service name>

On a system which uses systemd, how would you display the logs?

journalctl

Describe how to make a certain process/app a service

Linux - Troubleshooting & Debugging

Where system logs are located?

/var/log

How to follow file's content as it being appended without opening the file every time?

tail -f <file_name>

What are you using for troubleshooting and debugging network issues?

dstat -t is great for identifying network and disk issues. netstat -tnlaup can be used to see which processes are running on which ports. lsof -i -P can be used for the same purpose as netstat. ngrep -d any metafilter for matching regex against payloads of packets. tcpdump for capturing packets wireshark same concept as tcpdump but with GUI (optional).

What are you using for troubleshooting and debugging disk & file system issues?

dstat -t is great for identifying network and disk issues. opensnoop can be used to see which files are being opened on the system (in real time).

What are you using for troubleshooting and debugging process issues?

strace is great for understanding what your program does. It prints every system call your program executed.

What are you using for debugging CPU related issues?

top will show you how much CPU percentage each process consumes perf is a great choice for sampling profiler and in general, figuring out what your CPU cycles are "wasted" on flamegraphs is great for CPU consumption visualization (http://www.brendangregg.com/flamegraphs.html)

You get a call from someone claiming "my system is SLOW". What do you do?

Check with top for anything unusual

Run dstat -t to check if it's related to disk or network.

Check if it's network related with sar

Check I/O stats with iostat

Explain iostat output

How to debug binaries?

What is the difference between CPU load and utilization?

How you measure time execution of a program?

Linux Kernel

What is a kernel, and what does it do?

The kernel is part of the operating system and is responsible for tasks like:

Allocating memory

Schedule processes

Control CPU

How do you find out which Kernel version your system is using?

uname -a command

What is a Linux kernel module and how do you load a new module?

Explain user space vs. kernel space

The operating system executes the kernel in protected memory to prevent anyone from changing (and risking it crashing). This is what is known as "Kernel space". "User space" is where users executes their commands or applications. It's important to create this separation since we can't rely on user applications to not tamper with the kernel, causing it to crash.

Applications can access system resources and indirectly the kernel space by making what is called "system calls".

Linux - SSH

What is SSH? How to check if a Linux server is running SSH?

Wikipedia Definition: "SSH or Secure Shell is a cryptographic network protocol for operating network services securely over an unsecured network."

Hostinger.com Definition: "SSH, or Secure Shell, is a remote administration protocol that allows users to control and modify their remote servers over the Internet."

An SSH server will have SSH daemon running. Depends on the distribution, you should be able to check whether the service is running (e.g. systemctl status sshd).

Why SSH is considered better than telnet?

Telnet also allows you to connect to a remote host but as opposed to SSH where the communication is encrypted, in telnet, the data is sent in clear text, so it doesn't considered to be secured because anyone on the network can see what exactly is sent, including passwords.

What is stored in ~/.ssh/known_hosts?

You try to ssh to a server and you get "Host key verification failed". What does it mean?

It means that the key of the remote host was changed and doesn't match the one that stored on the machine (in ~/.ssh/known_hosts).

What is the difference between SSH and SSL?

What ssh-keygen is used for?

What is SSH port forwarding?

Linux - Globbing, Wildcards

What is Globbing?

What are wildcards? Can you give an example of how to use them?

Explain what will ls [XYZ] match

Explain what will ls [^XYZ] match

Explain what will ls [0-5] match

What each of the following matches

The ? matches any single character

The * matches zero or more characters

What do we grep for in each of the following commands?:

grep '[0-9]{1,3}.[0-9]{1,3}.[0-9]{1,3}.[0-9]{1,3}' some_file
grep -E "error|failure" some_file
grep '[0-9]$' some_file

An IP address

The word "error" or "failure"

Lines which end with a number

Which line numbers will be printed when running `grep '\baaa\b'` on the following content:

aaa bbb ccc.aaa aaaaaa

lines 1 and 3.

What is the difference single and double quotes?

What is escaping? What escape character is used for escaping?

What is an exit code? What exit codes are you familiar with?

An exit code (or return code) represents the code returned by a child process to its parent process.

0 is an exit code which represents success while anything higher than 1 represents error. Each number has different meaning, based on how the application was developed.

I consider this as a good blog post to read more about it: https://shapeshed.com/unix-exit-codes

Linux Boot Process

Tell me everything you know about the Linux boot process

Another way to ask this: what happens from the moment you turned on the server until you get a prompt

What is GRUB2?

What is Secure Boot?

What can you find in /boot?

Linux Disk & Filesystem

What's an inode?

For each file (and directory) in Linux there is an inode, a data structure which stores meta data related to the file like its size, owner, permissions, etc.

Which of the following is not included in inode:

Link count
File size
File name
File timestamp

File name (it's part of the directory file)

How to check which disks are currently mounted?

Run mount

You run the mount command but you get no output. How would you check what mounts you have on your system?

cat /proc/mounts

What is the difference between a soft link and hard link?

Hard link is the same file, using the same inode. Soft link is a shortcut to another file, using a different inode.

True or False? You can create an hard link for a directory

False

True or False? You can create a soft link between different filesystems

True

True or False? Directories always have by minimum 2 links

True.

What happens when you delete the original file in case of soft link and hard link?

Can you check what type of filesystem is used in /home?

There are many answers for this question. One way is running df -T

What is a swap partition? What is it used for?

How to create a

new empty file
a file with text (without using text editor)
a file with given size

You are trying to create a new file but you get "File system is full". You check with df for free space and you see you used only 20% of the space. What could be the problem?

How would you check what is the size of a certain directory?

du -sh

What is LVM?

Explain the following in regards to LVM:

What is NFS? What is it used for?

What RAID is used for? Can you explain the differences between RAID 0, 1, 5 and 10?

Describe the process of extending a filesystem disk space

What is lazy umount?

What is tmpfs?

What is stored in each of the following logs?

/var/log/messages
/var/log/boot.log

True or False? both /tmp and /var/tmp cleared upon system boot

False. /tmp is cleared upon system boot while /var/tmp is cleared every a couple of days or not cleared at all (depends on distro).

Linux Performance Analysis

How to check what is the current load average?

One can use uptime or top

You know how to see the load average, great. but what each part of it means? for example 1.43, 2.34, 2.78

This article summarizes the load average topic in a great way

How to check process usage?

pidstat

How to check disk I/O?

iostat -xz 1

How to check how much free memory a system has? How to check memory consumption by each process?

You can use the commands top and free

How to check TCP stats?

sar -n TCP,ETCP 1

Linux Processes

how to list all the processes running in your system?

ps -ef

How to run a process in the background and why to do that in the first place?

You can achieve that by specifying & at the end of the command. As to why, since some commands/processes can take a lot of time to finish execution or run forever, you may want to run them in the background instead of waiting for them to finish before gaining control again in current session.

How can you find how much memory a specific process consumes?

mem() { ps -eo rss,pid,euser,args:100 --sort %mem | grep -v grep | grep -i $@ | awk '{printf $1/1024 "MB"; $1=""; print }' } [Source](https://stackoverflow.com/questions/3853655/in-linux-how-to-tell-how-much-memory-processes-are-using)

What signal is used by default when you run 'kill *process id*'?

The default signal is SIGTERM (15). This signal kills process gracefully which means it allows it to save current state configuration.

What signals are you familiar with?

SIGTERM - default signal for terminating a process SIGHUP - common usage is for reloading configuration SIGKILL - a signal which cannot caught or ignored

To view all available signals run kill -l

What kill 0 does?

What kill -0 does?

What is a trap?

Every couple of days, a certain process stops running. How can you look into why it's happening?

What happens when you press ctrl + c?

What is a Daemon in Linux?

A background process. Most of these processes are waiting for requests or set of conditions to be met before actually running anything. Some examples: sshd, crond, rpcbind.

What are the possible states of a process in Linux?

Running (R) Uninterruptible Sleep (D) - The process is waiting for I/O Interruptible Sleep (S) Stopped (T) Dead (x) Zombie (z)

How do you kill a process in D state?

What is a zombie process?

A process which has finished to run but has not exited.

One reason it happens is when a parent process is programmed incorrectly. Every parent process should execute wait() to get the exit code from the child process which finished to run. But when the parent isn't checking for the child exit code, the child process can still exists although it finished to run.

How to get rid of zombie processes?

You can't kill a zombie process the regular way with kill -9 for example as it's already dead.

One way to kill zombie process is by sending SIGCHLD to the parent process telling it to terminate its child processes. This might not work if the parent process wasn't programmed properly. The invocation is kill -s SIGCHLD [parent_pid]

You can also try closing/terminating the parent process. This will make the zombie process a child of init (1) which does periodic cleanups and will at some point clean up the zombie process.

How to find all the

Processes executed/owned by a certain user
Process which are Java processes
Zombie Processes

If you mention at any point ps command with arugments, be familiar with what these arguments does exactly.

What is the init process?

It is the first process executed by the kernel during the booting of a system. It is a daemon process which runs till the system is shutdown. That is why, it is the parent of all the processes

Can you describe how processes are being created?

How to change the priority of a process? Why would you want to do that?

Can you explain how network process/connection is established and how it's terminated?>

What strace does? What about ltrace?

Find all the files which end with '.yml' and replace the number 1 in 2 in each file

find /some_dir -iname *.yml -print0 | xargs -0 -r sed -i "s/1/2/g"

You run ls and you get "/lib/ld-linux-armhf.so.3 no such file or directory". What is the problem?

The ls executable is built for an incompatible architecture.

How would you split a 50 lines file into 2 files of 25 lines each?

You can use the split command this way: split -l 25 some_file

What is a file descriptor? What file descriptors are you familiar with?

Kerberos File descriptor, also known as file handler, is a unique number which identifies an open file in the operating system.
In Linux (and Unix) the first three file descriptors are:

0 - the default data stream for input

1 - the default data stream for output

2 - the default data stream for output related to errors

This is a great article on the topic: https://www.computerhope.com/jargon/f/file-descriptor.htm

What is NTP? What is it used for?

Explain Kernel OOM

Linux Security

What is chroot? In what scenarios would you consider using it?

What is SELiunx?

What is Kerberos?

What is nftables?

What firewalld daemon is responsible for?

Do you have experience with hardening servers? Can you describe the process?

Linux - Networking

How to list all the interfaces?

ip link show

What is the loopback (lo) interface?

The loopback interface is a special, virtual network interface that your computer uses to communicate with itself. It is used mainly for diagnostics and troubleshooting, and to connect to servers running on the local machine.

What the following commands are used for?

ip addr
ip route
ip link
ping
netstat
traceroute

What is a network namespace? What is it used for?

How to check if a certain port is being used?

One of the following would work:

netstat -tnlp | grep <port_number> lsof -i -n -P | grep <port_number>

How can you turn your Linux server into a router?

What is a virtual IP? In what situation would you use it?

True or False? The MAC address of an interface is assigned/set by the OS

False

Can you have more than one default gateway in a given system?

Technically, yes.

What is telnet and why is it a bad idea to use it in production? (or at all)

Telnet is a type of client-server protocol that can be used to open a command line on a remote computer, typically a server. By default, all the data sent and received via telnet is transmitted in clear/plain text, therefore it should not be used as it does not encrypt any data between the client and the server.

What is the routing table? How do you view it?

How can you send an HTTP request from your shell?

Using nc is one way

What are packet sniffers? Have you used one in the past? If yes, which packet sniffers have you used and for what purpose?

It is a network utility that analyses and may inject tasks into the data-stream travelling over the targeted network.

How to list active connections?

How to trigger neighbor discovery in IPv6?

One way would be ping6 ff02::1

What is network interface bonding and do you know how it's performed in Linux?

What network bonding modes are there?

There a couple of modes:

balance-rr: round robing bonding

active-backup: a fault tolerance mode where only one is active

balance-tlb: Adaptive transmit load balancing

balance-alb: Adaptive load balancing

What is a bridge? How it's added in Linux OS?

Linux DNS

How to check what is the hostname of the system?

cat /etc/hostname

You can also run hostnamectl or hostname but that might print only a temporary hostname. The one in the file is the permanent one.

What the file /etc/resolv.conf is used for? What does it include?

What commands are you using for performing DNS queries (or troubleshoot DNS related issues)?

You can specify one or more of the following:

dig

host

nslookup

Linux - Packaging

Do you have experience with packaging? (as in building packages) Can you explain how does it works?

How packages installation/removal is performed on the distribution you are using?

The answer depends on the distribution being used.

In Fedora/CentOS/RHEL/Rocky it can be done with rpm or dnf commands. In Ubuntu it can be done with the apt command.

RPM: explain the spec format (what it should and can include)

How do you list the content of a package without actually installing it?

How to know to which package a file on the system belongs to? Is it a problem if it doesn't belongs to any package?

Where repositories are stored? (based on the distribution you are using)

What is an archive? How do you create one in Linux?

How to extract the content of an archive?

Why do we need package managers? Why not simply creating archives and publish them?

Package managers allow you to manage packages lifecycle as in installing, removing and updating the packages.
In addition, you can specify in a spec how a certain package will be installed - where to copy the files, which commands to run prior to the installation, post the installation, etc.

Linux DNF

What is DNF?

From the repo:

"Dandified YUM (DNF) is the next upcoming major version of YUM. It does package management using RPM, libsolv and hawkey libraries."

Official docs

How to look for a package that provides the command /usr/bin/git? (the package isn't necessarily installed)

dnf provides /usr/bin/git

Linux Applications and Services

What can you find in /etc/services?

How to make sure a Service starts automatically after a reboot or crash?

Depends on the init system.

Systemd: systemctl enable [service_name] System V: update-rc.d [service_name] and add this line id:5678:respawn:/bin/sh /path/to/app to /etc/inittab Upstart: add Upstart init script at /etc/init/service.conf

You run ssh 127.0.0.1 but it fails with "connection refused". What could be the problem?

SSH server is not installed

SSH server is not running

How to print the shared libraries required by a certain program? What is it useful for?

What is CUPS?

What types of web servers are you familiar with?

Nginx, Apache httpd.

Linux Users and Groups

What is a "superuser" (or root user)? How is it different from regular users?

How do you create users? Where user information is stored?

Which file stores information about groups?

How do you change/set the password of a user?

Which file stores users passwords? Is it visible for everyone?

Do you know how to create a new user without using adduser/useradd command?

What information is stored in /etc/passwd? explain each field

How to add a new user to the system without providing him the ability to log-in into the system?

adduser user_name --shell=/bin/false --no-create-home You can also add a user and then edit /etc/passwd.

How to switch to another user? How to switch to the root user?

su command. Use su - to switch to root

What is the UID the root user? What about a regular user?

What can you do if you lost/forogt the root password?

Re-install the OS IS NOT the right answer :)

What is /etc/skel?

How to see a list of who logged-in to the system?

Using the last command.

Explain what each of the following commands does:

useradd
usermod
whoami
id

Linux Hardware

Where can you find information on the processor?

/proc/cpuinfo

How can you print information on the BIOS, motherboard, processor and RAM?

dmidecoode

How can you print all the information on connected block devices in your system?

lsblk

True or False? In user space, applications don't have full access to hardware resources

True. Only in kernel space they have full access to hardware resources.

Linux - Security

How do you create a private key for a CA (certificate authority)?

One way is using openssl this way:

openssl genrsa -aes256 -out ca-private-key.pem 4096

How do you create a public key for a CA (certificate authority)?

openssl req -new -x509 -days 730 -key [private key file name] -sha256 -out ca.pem

If using the private key from the previous question then the command would be:

openssl req -new -x509 -days 730 -key ca-private-key.pem -sha256 -out ca.pem

Linux - Namespaces

What types of namespaces are there in Linux?

Process ID namespaces: these namespaces include independent set of process IDs

Mount namespaces: Isolation and control of mountpoints

Network namespaces: Isolates system networking resources such as routing table, interfaces, ARP table, etc.

UTS namespaces: Isolate host and domains

IPC namespaces: Isolates interprocess communications

User namespaces: Isolate user and group IDs

Time namespaces: Isolates time machine

True or False? In every PID (Process ID) namespace the first process assigned with the process id number 1

True. Inside the namespace it's PID 1 while to the parent namespace the PID is a different one.

True or False? In a child PID namespace all processes are aware of parent PID namespace and processes and the parent PID namespace has no visibility of child PID namespace processes

False. The opposite is true. Parent PID namespace is aware and has visibility of processes in child PID namespace and child PID namespace has no visibility as to what is going on in the parent PID namespace.

True or False? By default, when creating two separate network namespaces, a ping from one namespace to another will work fine

False. Network namespace has its own interfaces and routing table. There is no way (without creating a bridge for example) for one network namespace to reach another.

True or False? With UTS namespaces, processes may appear as if they run on different hosts and domains while running on the same host

True

True or False? It's not possible to have a root user with ID 0 in child user namespaces

False. In every child user namespace, it's possible to have a separate root user with uid of 0.

What time namespaces are used for?

In time namespaces processes can use different system time.

Linux - Virtualization

What virtualization solutions are available for Linux?

KVM

XEN

VirtualBox

Linux-VServer

User-mode Linux

...

What is KVM?

Is an open source virtualization technology used to operate on x86 hardware.

From the official docs Recommended read:

Red Hat Article - What is KVM?

What is Libvirt?

It's an open source collection of software used to manage virtual machines. It can be used with: KVM, Xen, LXC and others. It's also called Libvirt Virtualization API.

From the official docs Hypervisor supported docs

Linux - AWK

What the awk command does? Have you used it? What for?

From Wikipedia: "AWK is domain-specific language designed for text processing and typically used as a data extraction and reporting tool"

How to print the 4th column in a file?

awk '{print $4}' file

How to print every line that is longer than 79 characters?

awk 'length($0) > 79' file

What the lsof command does? Have you used it? What for?

What is the difference between find and locate?

How a user process performs a privileged operation, such as reading from the disk?

Using system calls

Linux - System Calls

What is a system call? What system calls are you familiar with?

How a program executes a system call?

A program executes a trap instruction. The instruction jump into the kernel while raising the privileged level to kernel space.

Once in kernel space, it can perform any privileged operation

Once it's finished, it calls a "return-from-trap" instruction which returns to user space while reducing back the privilege level to user space.

Explain the fork() system call

fork() is used for creating a new process. It does so by cloning the calling process but the child process has its own PID and any memory locks, I/O operations and semaphores are not inherited.

What is the return value of fork()?

On success, the PID of the child process in parent and 0 in child process

On error, -1 in the parent

Name one reason for fork() to fail

Not enough memory to create a new process

Why do we need the wait() system call?

wait() is used by a parent process to wait for the child process to finish execution. If wait is not used by a parent process then a child process might become a zombie process.

How the kernel notifies the parent process about child process termination?

The kernel notifies the parent by sending the SIGCHLD to the parent.

How the waitpid() is different from wait()?

The waitpid() is a non-blocking version of the wait() function.
It also supports using library routine (e.g. system()) to wait a child process without messing up with other children processes for which the process has not waited.

True or False? The wait() system call won't return until the child process has run and exited

True in most cases though there are cases where wait() returns before the child exits.

Explain the exec() system call

It transforms the current running program into another program.
Given the name of an executable and some arguments, it loads the code and static data from the specified executable and overwrites its current code segment and current static code data. After initializing its memory space (like stack and heap) the OS runs the program passing any arguments as the argv of that process.

True or False? A successful call to exec() never returns

True
Since a succesful exec replace the current process, it can't return anything to the process that made the call.

What system call is used for listing files?

What system calls are used for creating a new process?

fork(), exec() and the wait() system call is also included in this workflow.

What execve() does?

Executes a program. The program is passed as a filename (or path) and must be a binary executable or a script.

What is the return value of malloc?

Explain the pipe() system call. What does it used for?

Unix pipe implementation

"Pipes provide a unidirectional interprocess communication channel. A pipe has a read end and a write end. Data written to the write end of a pipe can be read from the read end of the pipe. A pipe is created using pipe(2), which returns two file descriptors, one referring to the read end of the pipe, the other referring to the write end."

What happens when you execute ls -l?

Shell reads the input using getline() which reads the input file stream and stores into a buffer as a string

The buffer is broken down into tokens and stored in an array this way: {"ls", "-l", "NULL"}

Shell checks if an expansion is required (in case of ls *.c)

Once the program in memory, its execution starts. First by calling readdir()

Notes:

getline() originates in GNU C library and used to read lines from input stream and stores those lines in the buffer

What happens when you execute ls -l *.log?

What readdir() system call does?

What exactly the command alias x=y does?

Why running a new program is done using the fork() and exec() system calls? why a different API wasn't developed where there is one call to run a new program?

This way provides a lot of flexibility. It allows the shell for example, to run code after the call to fork() but before the call to exec(). Such code can be used to alter the environment of the program it about to run.

Describe shortly what happens when you execute a command in the shell

The shell figures out, using the PATH variable, where the executable of the command resides in the filesystem. It then calls fork() to create a new child process for running the command. Once the fork was executed successfully, it calls a variant of exec() to execute the command and finally, waits the command to finish using wait(). When the child completes, the shell returns from wait() and prints out the prompt again.

Linux Filesystem & Files

How to create a file of a certain size?

There are a couple of ways to do that:

dd if=/dev/urandom of=new_file.txt bs=2MB count=1

truncate -s 2M new_file.txt

fallocate -l 2097152 new_file.txt

What does the following block do?:

open("/my/file") = 5
read(5, "file content")

These system calls are reading the file /my/file and 5 is the file descriptor number.

Describe three different ways to remove a file (or its content)

What is the difference between a process and a thread?

What is context switch?

From wikipedia: a context switch is the process of storing the state of a process or thread, so that it can be restored and resume execution at a later point

You found there is a server with high CPU load but you didn't find a process with high CPU. How is that possible?

Linux Advanced - Networking

When you run ip a you see there is a device called 'lo'. What is it and why do we need it?

What the traceroute command does? How does it works?

Another common way to task this questions is "what part of the tcp header does traceroute modify?"

What is network bonding? What types are you familiar with?

How to link two separate network namespaces so you can ping an interface on one namespace from the second one?

What are cgroups?

Explain Process Descriptor and Task Structure

What are the differences between threads and processes?

Explain Kernel Threads

What happens when socket system call is used?

This is a good article about the topic: https://ops.tips/blog/how-linux-creates-sockets

You executed a script and while still running, it got accidentally removed. Is it possible to restore the script while it's still running?

Linux Memory

What is the difference between MemFree and MemAvailable in /proc/meminfo?

MemFree - The amount of unused physical RAM in your system MemAvailable - The amount of available memory for new workloads (without pushing system to use swap) based on MemFree, Active(file), Inactive(file), and SReclaimable.

What is the difference between paging and swapping?

Explain what is OOM killer

Distribution

What is a Linux distribution?

What Linux distributions are you familiar with?

What are the components of a Linux distribution?

Kernel

Utilities

Services

Software/Packages Management

Linux - Sed

Using sed, extract the date from the following line: 201.7.19.90 - - [05/Jun/1985:13:42:99 +0000] "GET /site HTTP/1.1" 200 32421

echo $line | sed 's/.*\[//g;s/].*//g;s/:.*//g'

Linux - Misc

Name 5 commands which are two letters long

ls, wc, dd, df, du, ps, ip, cp, cd ...

What ways are there for creating a new empty file?

touch new_file

echo "" > new_file

How `cd -` works? How does it knows the previous location?

$OLDPWD

List three ways to print all the files in the current directory

ls

find .

echo *

How to count the number of lines in a file? What about words?

You define x=2 in /etc/bashrc and x=6 ~/.bashrc you then login to the system. What would be the value of x?

What is the difference between man and info?

A good answer can be found here

Explain "environment variables". How do you list all environment variables?

What is a TTY device?

How to create your own environment variables?

X=2 for example. But this will persist to new shells. To have it in new shells as well, use export X=2

What a double dash (--) mean?

It's used in commands to mark the end of commands options. One common example is when used with git to discard local changes: git checkout -- some_file

Wildcards are implemented on user or kernel space?

If I plug a new device into a Linux machine, where on the system, a new device entry/file will be created?

/dev

Why there are different sections in man? What is the difference between the sections?

What is User-mode Linux?

Under which license Linux is distributed?

GPL v2

Operating System

Operating System Exercises

Name	Topic	Objective & Instructions	Solution	Comments
Fork 101	Fork	Link	Link
Fork 102	Fork	Link	Link

Operating System - Self Assessment

What is an operating system?

From the book "Operating Systems: Three Easy Pieces":

"responsible for making it easy to run programs (even allowing you to seemingly run many at the same time), allowing programs to share memory, enabling programs to interact with devices, and other fun stuff like that".

Operating System - Process

Can you explain what is a process?

A process is a running program. A program is one or more instructions and the program (or process) is executed by the operating system.

If you had to design an API for processes in an operating system, what would this API look like?

It would support the following:

Create - allow to create new processes

Delete - allow to remove/destroy processes

State - allow to check the state of the process, whether it's running, stopped, waiting, etc.

Stop - allow to stop a running process

How a process is created?

The OS is reading program's code and any additional relevant data

Program's code is loaded into the memory or more specifically, into the address space of the process.

Memory is allocated for program's stack (aka run-time stack). The stack also initialized by the OS with data like argv, argc and parameters to main()

Memory is allocated for program's heap which is required for dynamically allocated data like the data structures linked lists and hash tables

I/O initialization tasks are performed, like in Unix/Linux based systems where each process has 3 file descriptors (input, output and error)

OS is running the program, starting from main()

True or False? The loading of the program into the memory is done eagerly (all at once)

False. It was true in the past but today's operating systems perform lazy loading which means only the relevant pieces required for the process to run are loaded first.

What are different states of a process?

Running - it's executing instructions

Ready - it's ready to run but for different reasons it's on hold

Blocked - it's waiting for some operation to complete. For example I/O disk request

What are some reasons for a process to become blocked?

I/O operations (e.g. Reading from a disk)

Waiting for a packet from a network

What is Inter Process Communication (IPC)?

What is "time sharing"?

Even when using a system with one physical CPU, it's possible to allow multiple users to work on it and run programs. This is possible with time sharing where computing resources are shared in a way it seems to the user the system has multiple CPUs but in fact it's simply one CPU shared by applying multiprogramming and multi-tasking.

What is "space sharing"?

Somewhat the opposite of time sharing. While in time sharing a resource is used for a while by one entity and then the same resource can be used by another resource, in space sharing the space is shared by multiple entities but in a way where it's not being transferred between them.
It's used by one entity until this entity decides to get rid of it. Take for example storage. In storage, a file is yours until you decide to delete it.

What component determines which process runs at a given moment in time?

CPU scheduler

Operating System - Memory

What is "virtual memory" and what purpose it serves?

What is demand paging?

What is copy-on-write or shadowing?

What is a kernel, and what does it do?

The kernel is part of the operating system and is responsible for tasks like:

Allocating memory

Schedule processes

Control CPU

True or False? Some pieces of the code in the kernel are loaded into protected areas of the memory so applications can't overwritten them

True

What is POSIX?

Explain what is Semaphore and what its role in operating systems

What is cache? What is buffer?

Buffer: Reserved place in RAM which is used to hold data for temporary purposes Cache: Cache is usually used when processes reading and writing to the disk to make the process faster by making similar data used by different programs easily accessible.

Virtualization

What is Virtualization?

Virtualization uses software to create an abstraction layer over computer hardware that allows the hardware elements of a single computer—processors, memory, storage and more - to be divided into multiple virtual computers, commonly called virtual machines (VMs).

What is a hypervisor?

Red Hat: "A hypervisor is software that creates and runs virtual machines (VMs). A hypervisor, sometimes called a virtual machine monitor (VMM), isolates the hypervisor operating system and resources from the virtual machines and enables the creation and management of those VMs."

Read more here

What types of hypervisors are there?

Hosted hypervisors and bare-metal hypervisors.

What are the advantages and disadvantges of bare-metal hypervisor over a hosted hypervisor?

Due to having its own drivers and a direct access to hardware components, a baremetal hypervisor will often have better performances along with stability and scalability.

On the other hand, there will probably be some limitation regarding loading (any) drivers so a hosted hypervisor will usually benefit from having a better hardware compatibility.

What types of virtualization are there?

Operating system virtualization Network functions virtualization Desktop virtualization

Is containerization is a type of Virtualization?

Yes, it's a operating-system-level virtualization, where the kernel is shared and allows to use multiple isolated user-spaces instances.

How the introduction of virtual machines changed the industry and the way applications were deployed?

The introduction of virtual machines allowed companies to deploy multiple business applications on the same hardware while each application is separated from each other in secured way, where each is running on its own separate operating system.

Programming

What programming language do you prefer to use for DevOps related tasks? Why specifically this one?

What are static typed (or simply typed) languages?

In static typed languages the variable type is known at compile-time instead of at run-time. Such languages are: C, C++ and Java

Explain expressions and statements

An expression is anything that results in a value (even if the value is None). Basically, any sequence of literals so, you can say that a string, integer, list, ... are all expressions.

Statements are instructions executed by the interpreter like variable assignments, for loops and conditionals (if-else).

What is Object Oriented Programming? Why is it important?

Explain Composition

What is a compiler?

What is an interpreter?

Are you familiar with SOLID design principles?

SOLID design principles are about:

Make it easier to extend the functionality of the system

Make the code more readable and easier to maintain

SOLID is:

Single Responsibility - A class should only have a single responsibility

Open-Closed - An entity should be open for extension, but closed for modification. What this practically means is that you should extend functionality by adding a new code and not by modifying it. Your system should be separated into components so it can be easily extended without breaking everything.

Liskov Substitution - Any derived class should be able to substitute the its parent without altering its corrections. Practically, every part of the code will get the expected result no matter which part is using it

Interface segregation - A client should never depend on anything it doesn't uses

Dependency Inversion - High level modules should depend on abstractions, not low level modules

What is YAGNI? What is your opinion on it?

What is DRY? What is your opinion on it?

What are the four pillars of object oriented programming?

Explain recursion

Explain Inversion of Control

Explain Dependency Injection

True or False? In Dynamically typed languages the variable type is known at run-time instead of at compile-time

True

Explain what are design patterns and describe three of them in detail

Explain big O notation

What is "Duck Typing"?

Explain string interpolation

Common algorithms

Binary search:

How does it works?
Can you implement it? (in any language you prefer)
What is the average performance of the algorithm you wrote?

It's a search algorithm used with sorted arrays/lists to find a target value by dividing the array each iteration and comparing the middle value to the target value. If the middle value is smaller than target value, then the target value is searched in the right part of the divided array, else in the left side. This continues until the value is found (or the array divided max times)

python implementation

The average performance of the above algorithm is O(log n). Best performance can be O(1) and worst O(log n).

Code Review

What are your code-review best practices?

Do you agree/disagree with each of the following statements and why?:

The commit message is not important. When reviewing a change/patch one should focus on the actual change
You shouldn't test your code before submitting it. This is what CI/CD exists for.

Strings

In any language you want, write a function to determine if a given string is a palindrome

In any language you want, write a function to determine if two strings are Anagrams

Integers

In any language you would like, print the numbers from 1 to a given integer. For example for input: 5, the output is: 12345

Time Complexity

Describe what would be the time complexity of the operations access, search insert and remove for the following data structures:

Stack

Queue

Linked List

Binary Search Tree

What is the complexity for the best, worst and average cases of each of the following algorithms?:

Quick sort
Merge sort
Bucket Sort
Radix Sort

Data Structures & Types

Implement Stack in any language you would like

Tell me everything you know about Linked Lists

A linked list is a data structure

It consists of a collection of nodes. Together these nodes represent a sequence

Useful for use cases where you need to insert or remove an element from any position of the linked list

Some programming languages don't have linked lists as a built-in data type (like Python for example) but it can be easily implemented

Describe (no need to implement) how to detect a loop in a Linked List

There are multiple ways to detect a loop in a linked list. I'll mention three here:

Worst solution:
Two pointers where one points to the head and one points to the last node. Each time you advance the last pointer by one and check whether the distance between head pointer to the moved pointer is bigger than the last time you measured the same distance (if not, you have a loop).
The reason it's probably the worst solution, is because time complexity here is O(n^2)

Decent solution:

Create an hash table and start traversing the linked list. Every time you move, check whether the node you moved to is in the hash table. If it isn't, insert it to the hash table. If you do find at any point the node in the hash table, it means you have a loop. When you reach None/Null, it's the end and you can return "no loop" value. This one is very easy to implement (just create a hash table, update it and check whether the node is in the hash table every time you move to the next node) but since the auxiliary space is O(n) because you create a hash table then, it's not the best solution

Good solution:
Instead of creating a hash table to document which nodes in the linked list you have visited, as in the previous solution, you can modify the Linked List (or the Node to be precise) to have a "visited" attribute. Every time you visit a node, you set "visited" to True.
Time compleixty is O(n) and Auxiliary space is O(1), so it's a good solution but the only problem, is that you have to modify the Linked List.

Best solution:
You set two pointers to traverse the linked list from the beginning. You move one pointer by one each time and the other pointer by two. If at any point they meet, you have a loop. This solution is also called "Floyd's Cycle-Finding"
Time complexity is O(n) and auxiliary space is O(1). Perfect :)

Implement Hash table in any language you would like

What is Integer Overflow? How is it handled?

Name 3 design patterns. Do you know how to implement (= provide an example) these design pattern in any language you'll choose?

Given an array/list of integers, find 3 integers which are adding up to 0 (in any language you would like)

def find_triplets_sum_to_zero(li): li = sorted(li) for i, val in enumerate(li): low, up = 0, len(li)-1 while low < i < up: tmp = var + li[low] + li[up] if tmp > 0: up -= 1 elif tmp < 0: low += 1 else: yield li[low], val, li[up] low += 1 up -= 1

Python

Python Exercises

Name	Topic	Objective & Instructions	Solution
Identify the data type	Data Types	Exercise	Solution
Identify the data type - Advanced	Data Types	Exercise	Solution
Reverse String	Strings	Exercise	Solution
Compress String	Strings	Exercise	Solution

Python Self Assessment

What are some characteristics of the Python programming language?

1. It is a high level general purpose programming language created in 1991 by Guido Van Rosum. 2. The language is interpreted, being the CPython (Written in C) the most used/maintained implementation. 3. It is strongly typed. The typing discipline is duck typing and gradual. 4. Python focuses on readability and makes use of whitespaces/identation instead of brackets { } 5. The python package manager is called PIP "pip installs packages", having more than 200.000 available packages. 6. Python comes with pip installed and a big standard library that offers the programmer many precooked solutions. 7. In python **Everything** is an object.

What built-in types Python has?

List Dictionary Set Numbers (int, float, ...) String Bool Tuple Frozenset

What is mutability? Which of the built-in types in Python are mutable?

Mutability determines whether you can modify an object of specific type.

The mutable data types are:

List Dictionary Set
The immutable data types are:

Numbers (int, float, ...) String Bool Tuple Frozenset

Python - Booleans

What is the result of each of the following?

1 > 2
'b' > 'a'

1 == 'one'

2 > 'one'

False

True

False

TypeError

What is the result of `bool("")`? What about `bool(" ")`? Explain

bool("") -> evaluates to False
bool(" ") -> evaluates to True

What is the result of running [] is not []? explain the result

It evaluates to True.
The reason is that the two created empty list are different objects. x is y only evaluates to true when x and y are the same object.

What is the result of running True-True?

Python - Strings

True or False? String is an immutable data type in Python

True

How to check if a string starts with a letter?

Regex:

import re if re.match("^[a-zA-Z]+.*", string):
string built-in:

if string and string[0].isalpha():

How to check if all characters in a given string are digits?

string.isdigit

How to remove trailing slash ('/') from a string?

string.rstrip('/')

What is the result of of each of the following?

"abc"*3
"abc"*2.5
"abc"*2.0
"abc"*True
"abc"*False

abcabcabc

TypeError

TypeError

"abc"

Improve the following code:

char = input("Insert a character: ")
if char == "a" or char == "o" or char == "e" or char =="u" or char == "i":
    print("It's a vowel!")

char = input("Insert a character: ") # For readablity if lower(char[0]) in "aieou": # Takes care of multiple characters and separate cases print("It's a vowel!")
OR

if lower(input("Insert a character: ")[0]) in "aieou": # Takes care of multiple characters and small/Capital cases print("It's a vowel!")

Python - Functions

How to define a function with Python?

Using the `def` keyword. For Examples:
def sum(a, b): return (a + b)

In Python, functions are first-class objects. What does it mean?

In general, first class objects in programming languages are objects which can be assigned to variable, used as a return value and can be used as arguments or parameters.
In python you can treat functions this way. Let's say we have the following function

def my_function(): return 5

You can then assign a function to a variables like this x = my_function or you can return functions as return values like this return my_function

Python - Integer

Write a function to determine if a number is a Palindrome

Python - OOP

Explain inheritance and how to use it in Python

By definition inheritance is the mechanism where an object acts as a base of another object, retaining all its properties. So if Class B inherits from Class A, every characteristics from class A will be also available in class B. Class A would be the 'Base class' and B class would be the 'derived class'. This comes handy when you have several classes that share the same functionalities. The basic syntax is: class Base: pass class Derived(Base): pass A more forged example: class Animal: def __init__(self): print("and I'm alive!") def eat(self, food): print("ñom ñom ñom", food) class Human(Animal): def __init__(self, name): print('My name is ', name) super().__init__() def write_poem(self): print('Foo bar bar foo foo bar!') class Dog(Animal): def __init__(self, name): print('My name is', name) super().__init__() def bark(self): print('woof woof') michael = Human('Michael') michael.eat('Spam') michael.write_poem() bruno = Dog('Bruno') bruno.eat('bone') bruno.bark() >>> My name is Michael >>> and I'm alive! >>> ñom ñom ñom Spam >>> Foo bar bar foo foo bar! >>> My name is Bruno >>> and I'm alive! >>> ñom ñom ñom bone >>> woof woof Calling super() calls the Base method, thus, calling super().__init__() we called the Animal __init__. There is a more advanced python feature called MetaClasses that aid the programmer to directly control class creation.

Explain and demonstrate class attributes & instance attributes

In the following block of code x is a class attribute while self.y is a instance attribute

class MyClass(object): x = 1 def __init__(self, y): self.y = y

Python - Exceptions

What is an error? What is an exception? What types of exceptions are you familiar with?

# Note that you generally don't need to know the compiling process but knowing where everything comes from # and giving complete answers shows that you truly know what you are talking about. Generally, every compiling process have a two steps. - Analysis - Code Generation. Analysis can be broken into: 1. Lexical analysis (Tokenizes source code) 2. Syntactic analysis (Check whether the tokens are legal or not, tldr, if syntax is correct) for i in 'foo' ^ SyntaxError: invalid syntax We missed ':' 3. Semantic analysis (Contextual analysis, legal syntax can still trigger errors, did you try to divide by 0, hash a mutable object or use an undeclared function?) 1/0 ZeroDivisionError: division by zero These three analysis steps are the responsible for error handlings. The second step would be responsible for errors, mostly syntax errors, the most common error. The third step would be responsible for Exceptions. As we have seen, Exceptions are semantic errors, there are many builtin Exceptions: ImportError ValueError KeyError FileNotFoundError IndentationError IndexError ... You can also have user defined Exceptions that have to inherit from the `Exception` class, directly or indirectly. Basic example: class DividedBy2Error(Exception): def __init__(self, message): self.message = message def division(dividend,divisor): if divisor == 2: raise DividedBy2Error('I dont want you to divide by 2!') return dividend / divisor division(100, 2) >>> __main__.DividedBy2Error: I dont want you to divide by 2!

Explain Exception Handling and how to use it in Python

Exceptions: Errors detected during execution are called Exceptions.

Handling Exception: When an error occurs, or exception as we call it, Python will normally stop and generate an error message.
Exceptions can be handled using try and except statement in python.

Example: Following example asks the user for input until a valid integer has been entered.
If user enter a non-integer value it will raise exception and using except it will catch that exception and ask the user to enter valid integer again.

while True: try: a = int(input("please enter an integer value: ")) break except ValueError: print("Ops! Please enter a valid integer value.")
For more details about errors and exceptions follow this https://docs.python.org/3/tutorial/errors.html

What is the result of running the following function?

def true_or_false():
    try:
        return True
    finally:
        return False

False

Python Built-in functions

Explain the following built-in functions (their purpose + use case example):

repr
any
all

What is the difference between repr function and str?

What is the __call__ method?

Do classes has the __call__ method as well? What for?

What _ is used for in Python?

Translation lookup in i18n

Hold the result of the last executed expression or statement in the interactive interpreter.

As a general purpose "throwaway" variable name. For example: x, y, _ = get_data() (x and y are used but since we don't care about third variable, we "threw it away").

Explain what is GIL

What is Lambda? How is it used?

A lambda expression is an 'anonymous' function, the difference from a normal defined function using the keyword `def`` is the syntax and usage.

The syntax is:

lambda[parameters]: [expresion]

Examples:

A lambda function add 10 with any argument passed.

x = lambda a: a + 10 print(x(10))

An addition function

addition = lambda x, y: x + y print(addition(10, 20))

Squaring function

square = lambda x : x ** 2 print(square(5))
Generally it is considered a bad practice under PEP 8 to assign a lambda expresion, they are meant to be used as parameters and inside of other defined functions.

Properties

Are there private variables in Python? How would you make an attribute of a class, private?

Explain the following:

getter
setter
deleter

Explain what is @property

How do you swap values between two variables?

x, y = y, x

Explain the following object's magic variables:

dict

Write a function to return the sum of one or more numbers. The user will decide how many numbers to use

First you ask the user for the amount of numbers that will be use. Use a while loop that runs until amount_of_numbers becomes 0 through subtracting amount_of_numbers by one each loop. In the while loop you want ask the user for a number which will be added a variable each time the loop runs.

def return_sum(): amount_of_numbers = int(input("How many numbers? ")) total_sum = 0 while amount_of_numbers != 0: num = int(input("Input a number. ")) total_sum += num amount_of_numbers -= 1 return total_sum

Print the average of [2, 5, 6]. It should be rounded to 3 decimal places

li = [2, 5, 6] print("{0:.3f}".format(sum(li)/len(li)))

Python - Lists

What is a tuple in Python? What is it used for?

A tuple is a built-in data type in Python. It's used for storing multiple items in a single variable.

List, like a tuple, is also used for storing multiple items. What is then, the difference between a tuple and a list?

List, as opposed to a tuple, is a mutable data type. It means we can modify it and at items to it.

How to add the number 2 to the list x = [1, 2, 3]

x.append(2)

How to check how many items a list contains?

len(sone_list)

How to get the last element of a list?

some_list[-1]

How to add the items of [1, 2, 3] to the list [4, 5, 6]?

x = [4, 5, 6] x.extend([1, 2, 3])

Don't use append unless you would like the list as one item.

How to remove the first 3 items from a list?

my_list[0:3] = []

How do you get the maximum and minimum values from a list?

Maximum: max(some_list) Minimum: min(some_list)

How to get the top/biggest 3 items from a list?

sorted(some_list, reverse=True)[:3]
Or

some_list.sort(reverse=True) some_list[:3]

How to insert an item to the beginning of a list? What about two items?

How to sort list by the length of items?

sorted_li = sorted(li, key=len)
Or without creating a new list:

li.sort(key=len)

Do you know what is the difference between list.sort() and sorted(list)?

sorted(list) will return a new list (original list doesn't change)

list.sort() will return None but the list is change in-place

sorted() works on any iterable (Dictionaries, Strings, ...)

list.sort() is faster than sorted(list) in case of Lists

Convert every string to an integer: [['1', '2', '3'], ['4', '5', '6']]

nested_li = [['1', '2', '3'], ['4', '5', '6']] [[int(x) for x in li] for li in nested_li]

How to merge two sorted lists into one sorted list?

sorted(li1 + li2)
Another way:

i, j = 0 merged_li = [] while i < len(li1) and j < len(li2): if li1[i] < li2[j]: merged_li.append(li1[i]) i += 1 else: merged_li.append(li2[j]) j += 1 merged_li = merged_li + merged_li[i:] + merged_li[j:]

How to check if all the elements in a given lists are unique? so [1, 2, 3] is unique but [1, 1, 2, 3] is not unique because 1 exists twice

There are many ways of solving this problem:
# Note: :list and -> bool are just python typings, they are not needed for the correct execution of the algorithm.

Taking advantage of sets and len:

def is_unique(l:list) -> bool:
    return len(set(l)) == len(l)

This one is can be seen used in other programming languages.

def is_unique2(l:list) -> bool:
    seen = []

    for i in l:
        if i in seen:
            return False
        seen.append(i)
    return True

Here we just count and make sure every element is just repeated once.

def is_unique3(l:list) -> bool:
    for i in l:
        if l.count(i) > 1:
            return False
    return True

This one might look more convulated but hey, one liners.

def is_unique4(l:list) -> bool:
    return all(map(lambda x: l.count(x) < 2, l))

You have the following function

def my_func(li = []):
    li.append("hmm")
    print(li)

If we call it 3 times, what would be the result each call?

['hmm'] ['hmm', 'hmm'] ['hmm', 'hmm', 'hmm']

How to iterate over a list?

for item in some_list: print(item)

How to iterate over a list with indexes?

for i, item in enumerate(some_list): print(i)

How to start list iteration from 2nd index?

Using range like this

for i in range(1, len(some_list)): some_list[i]
Another way is using slicing

for i in some_list[1:]:

How to iterate over a list in reverse order?

Method 1

for i in reversed(li): ...
Method 2

n = len(li) - 1 while n > 0: ... n -= 1

Sort a list of lists by the second item of each nested list

li = [[1, 4], [2, 1], [3, 9], [4, 2], [4, 5]] sorted(li, key=lambda l: l[1])
or

li.sort(key=lambda l: l[1)

Combine [1, 2, 3] and ['x', 'y', 'z'] so the result is [(1, 'x'), (2, 'y'), (3, 'z')]

nums = [1, 2, 3] letters = ['x', 'y', 'z'] list(zip(nums, letters))

What is List Comprehension? Is it better than a typical loop? Why? Can you demonstrate how to use it?

You have the following list: [{'name': 'Mario', 'food': ['mushrooms', 'goombas']}, {'name': 'Luigi', 'food': ['mushrooms', 'turtles']}] Extract all type of foods. Final output should be: {'mushrooms', 'goombas', 'turtles'}

brothers_menu = \ [{'name': 'Mario', 'food': ['mushrooms', 'goombas']}, {'name': 'Luigi', 'food': ['mushrooms', 'turtles']}] # "Classic" Way def get_food(brothers_menu) -> set: temp = [] for brother in brothers_menu: for food in brother['food']: temp.append(food) return set(temp) # One liner way (Using list comprehension) set([food for bro in x for food in bro['food']])

Python - Dictionaries

How to create a dictionary?

my_dict = dict(x=1, y=2) OR my_dict = {'x': 1, 'y': 2} OR my_dict = dict([('x', 1), ('y', 2)])

How to remove a key from a dictionary?

del my_dict['some_key'] you can also use my_dict.pop('some_key') which returns the value of the key.

How to sort a dictionary by values?

{k: v for k, v in sorted(x.items(), key=lambda item: item[1])}

How to sort a dictionary by keys?

dict(sorted(some_dictionary.items()))

How to merge two dictionaries?

some_dict1.update(some_dict2)

Convert the string "a.b.c" to the dictionary {'a': {'b': {'c': 1}}}

output = {} string = "a.b.c" path = string.split('.') target = reduce(lambda d, k: d.setdefault(k, {}), path[:-1], output) target[path[-1]] = 1 print(output)

Common Algorithms Implementation

Can you implement "binary search" in Python?

Solution

Python Files

How to write to a file?

with open('file.txt', 'w') as file: file.write("My insightful comment")

How to print the 12th line of a file?

How to reverse a file?

Sum all the integers in a given file

Print a random line of a given file

Print every 3rd line of a given file

Print the number of lines in a given file

Print the number of of words in a given file

Can you write a function which will print all the file in a given directory? including sub-directories

Write a dictionary (variable) to a file

import json with open('file.json', 'w') as f: f.write(json.dumps(dict_var))

Python OS

How to print current working directory?

import os print(os.getcwd())

Given the path /dir1/dir2/file1 print the file name (file1)

import os print(os.path.basename('/dir1/dir2/file1')) # Another way print(os.path.split('/dir1/dir2/file1')[1])

Given the path /dir1/dir2/file1

Print the path without the file name (/dir1/dir2)
Print the name of the directory where the file resides (dir2)

import os ## Part 1. # os.path.dirname gives path removing the end component dirpath = os.path.dirname('/dir1/dir2/file1') print(dirpath) ## Part 2. print(os.path.basename(dirpath))

How do you execute shell commands using Python?

How do you join path components? for example /home and luig will result in /home/luigi

How do you remove non-empty directory?

Python Regex

How do you perform regular expressions related operations in Python? (match patterns, substitute strings, etc.)

Using the re module

How to substitute the string "green" with "blue"?

How to find all the IP addresses in a variable? How to find them in a file?

Python Strings

Find the first repeated character in a string

While you iterate through the characters, store them in a dictionary and check for every character whether it's already in the dictionary.

def firstRepeatedCharacter(str): chars = {} for ch in str: if ch in chars: return ch else: chars[ch] = 0

How to extract the unique characters from a string? for example given the input "itssssssameeeemarioooooo" the output will be "mrtisaoe"

x = "itssssssameeeemarioooooo" y = ''.join(set(x))

Find all the permutations of a given string

def permute_string(string): if len(string) == 1: return [string] permutations = [] for i in range(len(string)): swaps = permute_string(string[:i] + string[(i+1):]) for swap in swaps: permutations.append(string[i] + swap) return permutations print(permute_string("abc"))
Short way (but probably not acceptable in interviews):

from itertools import permutations [''.join(p) for p in permutations("abc")]
Detailed answer can be found here: http://codingshell.com/python-all-string-permutations

How to check if a string contains a sub string?

Find the frequency of each character in string

Count the number of spaces in a string

Given a string, find the N most repeated words

Given the string (which represents a matrix) "1 2 3\n4 5 6\n7 8 9" create rows and colums variables (should contain integers, not strings)

What is the result of each of the following?

>> ', '.join(["One", "Two", "Three"])
>> " ".join("welladsadgadoneadsadga".split("adsadga")[:2])
>> "".join(["c", "t", "o", "a", "o", "q", "l"])[0::2]

>>> 'One, Two, Three' >>> 'well done' >>> 'cool'

If x = "pizza", what would be the result of x[::-1]?

It will reverse the string, so x would be equal to azzip.

Reverse each word in a string (while keeping the order)

What is the output of the following code: "".join(["a", "h", "m", "a", "h", "a", "n", "q", "r", "l", "o", "i", "f", "o", "o"])[2::3]

mario

Python Iterators

What is an iterator?

Python Misc

Explain data serialization and how do you perform it with Python

How do you handle argument parsing in Python?

What is a generator? Why using generators?

What would be the output of the following block?

for i in range(3, 3):
   print(i)

No output :)

What is yeild? When would you use it?

Explain the following types of methods and how to use them:

Static method
Class method
instance method

How to reverse a list?

How to combine list of strings into one string with spaces between the strings

You have the following list of nested lists: [['Mario', 90], ['Geralt', 82], ['Gordon', 88]] How to sort the list by the numbers in the nested lists?

One way is:

the_list.sort(key=lambda x: x[1])

Explain the following:

zip()
map()
filter()

Python - Slicing

For the following slicing exercises, assume you have the following list: my_list = [8, 2, 1, 10, 5, 4, 3, 9]

What is the result of `my_list[0:4]`?

What is the result of `my_list[5:6]`?

What is the result of `my_list[5:5]`?

What is the result of `my_list[::-1]`?

What is the result of `my_list[::3]`?

What is the result of `my_list[2:]`?

What is the result of `my_list[:3]`?

Python Debugging

How do you debug Python code?

pdb :D

How to check how much time it took to execute a certain script or block of code?

What empty return returns?

Short answer is: It returns a None object.

We could go a bit deeper and explain the difference between

def a ():
    return

>>> None

And

def a ():
    pass

>>> None

Or we could be asked this as a following question, since they both give the same result.

We could use the dis module to see what's going on:

  2           0 LOAD_CONST               0 (<code object a at 0x0000029C4D3C2DB0, file "<dis>", line 2>)
              2 LOAD_CONST               1 ('a')
              4 MAKE_FUNCTION            0
              6 STORE_NAME               0 (a)

  5           8 LOAD_CONST               2 (<code object b at 0x0000029C4D3C2ED0, file "<dis>", line 5>)
             10 LOAD_CONST               3 ('b')
             12 MAKE_FUNCTION            0
             14 STORE_NAME               1 (b)
             16 LOAD_CONST               4 (None)
             18 RETURN_VALUE

Disassembly of <code object a at 0x0000029C4D3C2DB0, file "<dis>", line 2>:
  3           0 LOAD_CONST               0 (None)
              2 RETURN_VALUE

Disassembly of <code object b at 0x0000029C4D3C2ED0, file "<dis>", line 5>:
  6           0 LOAD_CONST               0 (None)
              2 RETURN_VALUE

An empty return is exactly the same as return None and functions without any explicit return will always return None regardless of the operations, therefore

def sum(a, b):
    global c
    c = a + b

>>> None

How to improve the following block of code?

li = []
for i in range(1, 10):
    li.append(i)

[i for i in range(1, 10)]

Given the following function

def is_int(num):
    if isinstance(num, int):
        print('Yes')
    else:
        print('No')

What would be the result of is_int(2) and is_int(False)?

Python - Linked List

Can you implement a linked list in Python?

The reason we need to implement in the first place, it's because a linked list isn't part of Python standard library.
To implement a linked list, we have to implement two structures: The linked list itself and a node which is used by the linked list.

Let's start with a node. A node has some value (the data it holds) and a pointer to the next node

class Node(object): def __init__(self, data): self.data = data self.next = None
Now the linked list. An empty linked list has nothing but an empty head.

class LinkedList(object): def __init__(self): self.head = None
Now we can start using the linked list

ll = Linkedlist() ll.head = Node(1) ll.head.next = Node(2) ll.head.next.next = Node(3)
What we have is:

| 1 | -> | 2 | -> | 3 |

Usually, more methods are implemented, like a push_head() method where you insert a node at the beginning of the linked list

def push_head(self, value): new_node = Node(value) new_node.next = self.head self.head = new_node

Add a method to the Linked List class to traverse (print every node's data) the linked list

def print_list(self): node = self.head while(node): print(node.data) node = node.next

Write a method to that will return a boolean based on whether there is a loop in a linked list or not

Let's use the Floyd's Cycle-Finding algorithm:

def loop_exists(self): one_step_p = self.head two_steps_p = self.head while(one_step_p and two_steps_p and two_steps_p.next): one_step_p = self.head.next two_step_p = self.head.next.next if (one_step_p == two_steps_p): return True return False

Python - Stack

Implement Stack in Python

Python Testing

What is your experience with writing tests in Python?

What is PEP8? Give an example of 3 style guidelines

PEP8 is a list of coding conventions and style guidelines for Python

5 style guidelines:

1. Limit all lines to a maximum of 79 characters. 2. Surround top-level function and class definitions with two blank lines. 3. Use commas when making a tuple of one element 4. Use spaces (and not tabs) for indentation 5. Use 4 spaces per indentation level

How to test if an exception was raised?

What assert does in Python?

Explain mocks

How do you measure execution time of small code snippets?

Why one shouldn't use assert in non-test/production code?

Flask

Can you describe what is Django/Flask and how you have used it? Why Flask and not Django? (or vice versa)

What is a route?

As every web framework, Flask provides a route functionality that lets you serve a content of a given URL.
There are multiple ways to map a URL with a function in Python.

Decorator: you can use python decorators. In this case we're using app. This app decorator is the instance of the Flask class. And route it's a method of this class.

@app.route('/') def home(): return 'main website'

add_url_rule method: This is a method of the Flask class. We can also use it for map the URL with a function.

def home(): return 'main website' app.add_url_rule('/', view_func=home)

What is a blueprint in Flask?

What is a template?

zip

Given x = [1, 2, 3], what is the result of list(zip(x))?

[(1,), (2,), (3,)]

What is the result of each of the following:

list(zip(range(5), range(50), range(50)))
list(zip(range(5), range(50), range(-2)))

[(0, 0, 0), (1, 1, 1), (2, 2, 2), (3, 3, 3), (4, 4, 4)] []

Python Descriptors

Explain Descriptors

Read about descriptors here

What would be the result of running a.num2 assuming the following code

class B:
    def __get__(self, obj, objtype=None):
        reuturn 10

class A:
    num1 = 2
    num2 = Five()

What would be the result of running some_car = Car("Red", 4) assuming the following code

class Print:

    def __get__(self, obj, objtype=None):
        value = obj._color
        print("Color was set to {}".format(valie))
        return value

    def __set__(self, obj, value):
        print("The color of the car is {}".format(value))
        obj._color = value

class Car:

    color = Print()

    def __ini__(self, color, age):
        self.color = color
        self.age = age

An instance of Car class will be created and the following will be printed: "The color of the car is Red"

Python Misc

How can you spawn multiple processes with Python?

Implement simple calculator for two numbers

def add(num1, num2): return num1 + num2 def sub(num1, num2): return num1 - num2 def mul(num1, num2): return num1*num2 def div(num1, num2): return num1 / num2 operators = { '+': add, '-': sub, '*': mul, '/': div } if __name__ == '__main__': operator = str(input("Operator: ")) num1 = int(input("1st number: ")) num2 = int(input("2nd number: ")) print(operators[operator](num1, num2))

What data types are you familiar with that are not Python built-in types but still provided by modules which are part of the standard library?

This is a good reference https://docs.python.org/3/library/datatypes.html

Explain what is a decorator

In python, everything is an object, even functions themselves. Therefore you could pass functions as arguments for another function eg;
def wee(word): return word def oh(f): return f + "Ohh" >>> oh(wee("Wee")) <<< Wee Ohh
This allows us to control the before execution of any given function and if we added another function as wrapper, (a function receiving another function that receives a function as parameter) we could also control the after execution.

Sometimes we want to control the before-after execution of many functions and it would get tedious to write

f = function(function_1()) f = function(function_1(function_2(*args)))

every time, that's what decorators do, they introduce syntax to write all of this on the go, using the keyword '@'.

Can you show how to write and use decorators?

These two decorators (ntimes and timer) are usually used to display decorators functionalities, you can find them in lots of tutorials/reviews. I first saw these examples two years ago in pyData 2017. https://www.youtube.com/watch?v=7lmCu8wz8ro&t=3731s
Simple decorator: def deco(f): print(f"Hi I am the {f.__name__}() function!") return f @deco def hello_world(): return "Hi, I'm in!" a = hello_world() print(a) >>> Hi I am the hello_world() function! Hi, I'm in!
This is the simplest decorator version, it basically saves us from writting a = deco(hello_world()). But at this point we can only control the before execution, let's take on the after:

def deco(f): def wrapper(*args, **kwargs): print("Rick Sanchez!") func = f(*args, **kwargs) print("I'm in!") return func return wrapper @deco def f(word): print(word) a = f("************") >>> Rick Sanchez! ************ I'm in!
deco receives a function -> f wrapper receives the arguments -> *args, **kwargs

wrapper returns the function plus the arguments -> f(*args, **kwargs) deco returns wrapper.

As you can see we conveniently do things before and after the execution of a given function.

For example, we could write a decorator that calculates the execution time of a function.

import time def deco(f): def wrapper(*args, **kwargs): before = time.time() func = f(*args, **kwargs) after = time.time() print(after-before) return func return wrapper @deco def f(): time.sleep(2) print("************") a = f() >>> 2.0008859634399414
Or create a decorator that executes a function n times.

def n_times(n): def wrapper(f): def inner(*args, **kwargs): for _ in range(n): func = f(*args, **kwargs) return func return inner return wrapper @n_times(4) def f(): print("************") a = f() >>>************ ************ ************ ************

Write a decorator that calculates the execution time of a function

Write a script which will determine if a given host is accessible on a given port

Are you familiar with Dataclasses? Can you explain what are they used for?

You wrote a class to represent a car. How would you compare two cars instances if two cars are equal if they have the same model and color?

class Car: def __init__(self, model, color): self.model = model self.color = color def __eq__(self, other): if not isinstance(other, Car): return NotImplemented return self.model == other.model and self.color == other.color >> a = Car('model_1', 'red') >> b = Car('model_2', 'green') >> c = Car('model_1', 'red') >> a == b False >> a == c True

Explain Context Manager

Tell me everything you know about concurrency in Python

Explain the Buffer Protocol

Do you have experience with web scraping? Can you describe what have you used and for what?

Can you implement Linux's tail command in Python? Bonus: implement head as well

You have created a web page where a user can upload a document. But the function which reads the uploaded files, runs for a long time, based on the document size and user has to wait for the read operation to complete before he/she can continue using the web site. How can you overcome this?

How yield works exactly?

Monitoring

Explain monitoring. What is it? What its goal?

Google: "Monitoring is one of the primary means by which service owners keep track of a system’s health and availability".

What is wrong with the old approach of watching for a specific value and trigger an email/phone alert while value is exceeded?

This approach require from a human to always check why the value exceeded and how to handle it while today, it is more effective to notify people only when they need to take an actual action. If the issue doesn't require any human intervention, then the problem can be fixed by some processes running in the relevant environment.

What types of monitoring outputs are you familiar with and/or used in the past?

Alerts
Tickets
Logging

What is the difference between infrastructure monitoring and application monitoring? (methods, tools, ...)

Prometheus

What is Prometheus? What are some of Prometheus's main features?

In what scenarios it might be better to NOT use Prometheus?

From Prometheus documentation: "if you need 100% accuracy, such as for per-request billing".

Describe Prometheus architecture and components

Can you compare Prometheus to other solutions like InfluxDB for example?

What is an Alert?

Describe the following Prometheus components:

Prometheus server
Push Gateway
Alert Manager

Prometheus server is responsible for scraping and storing the data
Push gateway is used for short-lived jobs
Alert manager is responsible for alerts ;)

What is an Instance? What is a Job?

What core metrics types Prometheus supports?

What is an exporter? What is it used for?

Which Prometheus best practices are you familiar with?. Name at least three

How to get total requests in a given period of time?

What HA in Prometheus means?

How do you join two metrics?

How to write a query that returns the value of a label?

How do you convert cpu_user_seconds to cpu usage in percentage?

Go

What are some characteristics of the Go programming language?

Strong and static typing - the type of the variables can't be changed over time and they have to be defined at compile time

Simplicity

Fast compile times

Built-in concurrency

Garbage collected

Platform independent

Compile to standalone binary - anything you need to run your app will be compiled into one binary. Very useful for version management in run-time.

Go also has good community.

What is the difference between var x int = 2 and x := 2?

The result is the same, a variable with the value 2.

With var x int = 2 we are setting the variable type to integer while with x := 2 we are letting Go figure out by itself the type.

True or False? In Go we can redeclare variables and once declared we must use it.

False. We can't redeclare variables but yes, we must used declared variables.

What libraries of Go have you used?

This should be answered based on your usage but some examples are:

fmt - formatted I/O

What is the problem with the following block of code? How to fix it?

func main() {
    var x float32 = 13.5
    var y int
    y = x
}

The following block of code tries to convert the integer 101 to a string but instead we get "e". Why is that? How to fix it?

package main

import "fmt"

func main() {
    var x int = 101
    var y string
    y = string(x)
    fmt.Println(y)
}

It looks what unicode value is set at 101 and uses it for converting the integer to a string. If you want to get "101" you should use the package "strconv" and replace y = string(x) with y = strconv.Itoa(x)

What is wrong with the following code?:

package main

func main() {
    var x = 2
    var y = 3
    const someConst = x + y
}

Constants in Go can only be declared using constant expressions. But x, y and their sum is variable.
const initializer x + y is not a constant