Round 3 started off with the relatively accessible Googlements problem, in which a bit of combinatorics knowledge saved some time. Good News and Bad News asked contestants to assign weights to directed edges in a graph to achieve a particular goal; it could be solved via a construction or a sufficiently well-designed randomized algorithm. Mountain Tour was a complex-looking graph problem that boiled down to one interesting key insight, and Slate Modern was a challenging implementation exercise to round out the set.
It was an exciting contest! The lead changed many times in the first hour and a half. Gennady.Korotkevich was the first to hit 74 points, with vepifanov following a mere 8 seconds later! kevinsogo was the first of only four contestants to solve the tough Large of Slate Modern, over an hour and a half into the contest, and that plus other solves propelled him to a victory; he was the only contestant to reach 76 points. The Large of Mountain Tour was more widely solved, and the cutoff for the top 25 turned out to be 74 points and a finishing time under 2:30.
We'll see our finalists in August in Dublin, where they will compete for a $15,000 prize and the title of Code Jam World Champion! Will Gennady.Korotkevich successfully defend his title to become a consecutive four-time champion? Or will kevinsogo, who took second place in last year's Finals, repeat the first-place performance that we just saw in Round 3? Or will we see something completely different? Find out next time!
Cast
Problem A (Googlements): Written and prepared by Ian Tullis.
Problem B (Good News and Bad News): Written by Petr Mitrichev. Prepared by Ian Tullis.
Problem C (Mountain Tour): Written by David Arthur. Prepared by Shane Carr.
Problem D (Slate Modern): Written by Pablo Heiber. Prepared by Petr Mitrichev.
Solutions and other problem preparation and review by Liang Bai, John Dethridge, Jackson Gatenby, Md Mahbubul Hasan, Zhusong Li, Petr Mitrichev, Trung Thanh Nguyen, and Josef Ziegler.
Analysis authors:
Chemists work with periodic table elements, but here at Code Jam, we have
been using our advanced number smasher to study googlements. A
googlement is a substance that can be represented by a string of at most
nine digits. A googlement of length L must contain only decimal digits in the
range 0 through L, inclusive, and it must contain at least one digit greater
than 0. Leading zeroes are allowed. For example, 103
and
001
are valid googlements of length 3. 400
(which contains a digit, 4, greater than the length of the googlement, 3) and
000
(which contains no digit greater than 0) are not.
Any valid googlement can appear in the world at any time, but it will
eventually decay into another googlement in a deterministic way, as follows.
For a googlement of length L, count the number of 1
s in the
googlement (which could be 0) and write down that value, then count the
number of 2
s in the googlement (which could be 0) and write down
that value to the right of the previous value, and so on, until you finally
count and write down the number of Ls. The new string generated in this way
represents the new googlement, and it will also have length L. It is even
possible for a googlement to decay into itself!
For example, suppose that the googlement 0414
has just appeared.
This has one 1
, zero 2
s, zero 3
s, and
two 4
s, so it will decay into the googlement 1002
.
This has one 1
, one 2
, zero 3
s, and
zero 4
s, so it will decay into 1100
, which will
decay into 2000
, which will decay into 0100
, which
will decay into 1000
, which will continuously decay into itself.
You have just observed a googlement G. This googlement might have just appeared in the world, or it might be the result of one or more decay steps. What is the total number of possible googlements it could have been when it originally appeared in the world?
The first line of the input gives the number of test cases, T. T test cases follow. Each consists of one line with a string G, representing a googlement.
For each test case, output one line containing Case #x: y
,
where x
is the test case number (starting from 1) and
y
is the number of different googlements that the observed
googlement could have been when it first appeared in the world.
Memory limit: 1 GB.
1 ≤ T ≤ 100.
Each digit in G is a decimal digit between 0 and the length of
G, inclusive.
G contains at least one non-zero digit.
Time limit: 20 seconds.
1 ≤ the length of G ≤ 5.
Time limit: 60 seconds.
1 ≤ the length of G ≤ 9.
3 20 1 123
Case #1: 4 Case #2: 1 Case #3: 1
In sample case #1, the googlement could have originally been
20
, or it could have decayed from 11
, which could
have itself decayed from 12
or 21
. Neither of the
latter two could have been a product of decay. So there are four
possibilities in total.
In sample case #2, the googlement must have originally been 1
,
which is the only possible googlement of length 1.
In sample case #3, the googlement must have been 123
; no other
googlement could have decayed into it.
You would like to get your F friends to share some news. You know your friends well, so you know which of your friends can talk to which of your other friends. There are P such one-way relationships, each of which is an ordered pair (Ai, Bi) that means that friend Ai can talk to friend Bi. It does not imply that friend Bi can talk to friend Ai; however, another of the ordered pairs might make that true.
For every such existing ordered pair (Ai, Bi), you want friend Ai to deliver some news to friend Bi. In each case, this news will be represented by an integer value; the magnitude of the news is given by the absolute value, and the type of news (good or bad) is given by the sign. The integer cannot be 0 (or else there would be no news!), and its absolute value cannot be larger than F2 (or else the news would be just too exciting!). These integer values may be different for different ordered pairs.
Because you are considerate of your friends' feelings, for each friend, the sum of the values of all news given by that friend must equal the sum of values of all news given to that friend. If no news is given by a friend, that sum is considered to be 0; if no news is given to a friend, that sum is considered to be 0.
Can you find a set of news values for your friends to communicate such that these rules are obeyed, or determine that it is impossible?
The first line of the input gives the number of test cases, T. T test cases follow. Each begins with one line with two integers F and P: the number of friends, and the number of different ordered pairs of friends. Then, P more lines follow; the i-th of these lines has two different integers Ai and Bi representing that friend Ai can talk to friend Bi. Friends are numbered from 1 to F.
For each test case, output one line containing Case #x: y
, where
x
is the test case number (starting from 1) and y
is
either IMPOSSIBLE
if there is no arrangement satisfying the rules
above, or, if there is such an arrangement, P integers, each of which
is nonzero and lies inside [-F2, F2]. The
i-th of those integers corresponds to the i-th ordered pair from the input,
and represents the news value that the first friend in the ordered pair will
communicate to the second. The full set of values must satisfy the conditions
in the problem statement.
If there are multiple possible answers, you may output any of them.
Memory limit: 1 GB.
1 ≤ T ≤ 100.
1 ≤ Ai ≤ F, for all i.
1 ≤ Bi ≤ F, for all i.
Ai ≠ Bi, for all i. (A friend does not
self-communicate.)
(Ai, Bi) ≠ (Aj,
Bj), for all i ≠ j. (No pair of friends is repeated
within a test case in the same order.)
Time limit: 20 seconds.
2 ≤ F ≤ 4.
1 ≤ P ≤ 12.
Time limit: 40 seconds.
2 ≤ F ≤ 1000.
1 ≤ P ≤ 2000.
5 2 2 1 2 2 1 2 1 1 2 4 3 1 2 2 3 3 1 3 4 1 2 2 3 3 1 2 1 3 3 1 3 2 3 1 2
Case #1: 1 1 Case #2: IMPOSSIBLE Case #3: -1 -1 -1 Case #4: 4 -4 -4 8 Case #5: -1 1 1
The sample output shows one possible set of valid answers. Other valid answers are possible.
In Sample Case #1, one acceptable arrangement is to have friend 1 deliver news with value 1 to friend 2, and vice versa.
In Sample Case #2, whatever value of news friend 1 gives to friend 2, it
must be nonzero. So, the sum of news values given to friend 2 is not equal
to zero. However, friend 2 cannot give any news and so that value is 0.
Therefore, the sums of given and received news for friend 2 cannot match, and
the case is IMPOSSIBLE
.
In Sample Case #3, each of friends 1, 2, and 3 can deliver news with value -1 to the one other friend they can talk to — an unfortunate circle of bad news! Note that there is a friend 4 who does not give or receive any news; this still obeys the rules.
In Sample Case #4, note that -5 5 5 -10
would not have been an
acceptable answer, because there are 3 friends, and |-10| > 32.
In Sample Case #5, note that the case cannot be solved without using at least one negative value.
You are on top of Mount Everest, and you want to enjoy all the nice hiking trails that are up there. However, you know from past experience that climbing around on Mount Everest alone is bad — you might get lost in the dark! So you want to go on hikes at pre-arranged times with tour guides.
There are C camps on the mountain (numbered 1 through C), and there are 2 × C one-way hiking tours (numbered 1 through 2 × C). Each hiking tour starts at one camp and finishes at a different camp, and passes through no other camps in between. Mount Everest is sparsely populated, and business is slow; there are exactly 2 hiking tours departing from each camp, and exactly 2 hiking tours arriving at each camp.
Each hiking tour runs daily. Tours 1 and 2 start at camp 1, tours 3 and 4 start at camp 2, and so on: in general, tour 2 × i - 1 and tour 2 × i start at camp i. The i-th hiking tour ends at camp number Ei, leaves at hour Li, and has a duration of exactly Di hours.
It is currently hour 0; the hours in a day are numbered 0 through 23. You are at camp number 1, and you want to do each of the hiking tours exactly once and end up back at camp number 1. You cannot travel between camps except via hiking tours. While you are in a camp, you may wait for any number of hours (including zero) before going on a hiking tour, but you can only start a hiking tour at the instant that it departs.
After looking at the tour schedules, you have determined that it is definitely possible to achieve your goal, but you want to do it as fast as possible. If you plan your route optimally, how many hours will it take you to finish all of the tours?
The first line of the input gives the number of test cases, T. T test cases follow. Each begins with one line with an integer C: the number of camps. Then, 2 × C more lines follow. The i-th of these lines (counting starting from 1) represents one hiking tour starting at camp number floor((i + 1) / 2), and contains three integers Ei, Li, and Di, as described above. Note that this format guarantees that exactly two tours start at each camp.
For each test case, output one line containing Case #x: y
, where
x
is the test case number (starting from 1) and y
is the minimum number of hours that it will take you to achieve your goal,
as described above.
Time limit: 20 seconds per test set.
Memory limit: 1 GB.
1 ≤ T ≤ 100.
1 ≤ Ei ≤ C.
Ei ≠ ceiling(i / 2), for all i. (No hiking tour
starts and ends at the same camp.)
size of {j : Ej = i} = 2, for all j. (Exactly two tours
end at each camp.)
0 ≤ Li ≤ 23.
1 ≤ Di ≤ 1000.
There is at least one route that starts and ends at camp 1 and includes each
hiking tour exactly once.
2 ≤ C ≤ 15.
2 ≤ C ≤ 1000.
2 2 2 1 5 2 0 3 1 4 4 1 6 3 4 3 0 24 2 0 24 4 0 24 4 0 24 2 0 24 1 0 24 3 0 24 1 0 24
Case #1: 32 Case #2: 192
In sample case #1, the optimal plan is as follows:
This achieves the goal in 1 day and 8 hours, or 32 hours. Any other plan takes longer.
In sample case #2, all of the tours leave at the same time and are the same duration. After finishing any tour, you can immediately take another tour. If we number the tours from 1 to 8 in the order in which they appear in the test case, one optimal plan is: 1, 5, 4, 7, 6, 2, 3, 8.
The prestigious Slate Modern gallery specializes in the latest art craze: grayscale paintings that follow very strict rules. Any painting in the gallery must be a grid with R rows and C columns. Each cell in the grid is painted with a color of a certain positive integer brightness value; to make sure the art is not too visually startling, the brightness values of any two cells that share an edge (not just a corner) must differ by no more than D units.
Your artist friend Cody-Jamal is working on a canvas for the gallery. Last night, he became inspired and filled in N different particular cells with certain positive integer brightness values. You just told him about the gallery's rules today, and now he wants to know whether it is possible to fill in all of the remaining cells with positive integer brightness values and complete the painting without breaking the gallery's rules. If this is possible, he wants to make the sum of the brightness values as large as possible, to save his black paint. Can you help him find this sum or determine that the task is impossible? Since the output can be a really big number, we only ask you to output the remainder of dividing the result by the prime 109+7 (1000000007).
The first line of the input gives the number of test cases, T. T test cases follow. Each test case begins with one line with four integers: R, C, N, and D, as described above. Then, N lines follow; the i-th of these has three integers Ri, Ci, and Bi, indicating that the cell in the Rith row and Cith column of the grid has brightness value Bi. The rows and columns of the grid are numbered starting from 1.
For each test case, output one line containing Case #x: y
, where
x
is the test case number (starting from 1) and y
is either IMPOSSIBLE
if it is impossible to complete the
picture, or else the value of the maximum possible sum of all brightness
values modulo the prime 109+7 (1000000007).
Memory limit: 1 GB.
1 ≤ T ≤ 100.
1 ≤ N ≤ 200.
1 ≤ D ≤ 109.
1 ≤ Ri ≤ R, for all i.
1 ≤ Ci ≤ C, for all i.
1 ≤ Bi ≤ 109, for all i.
(Note that the upper bound only applies to cells that Cody-Jamal already
painted. You can assign brightness values larger than 109 to other
cells.)
N < R × C. (There is at least one empty
cell.)
Ri ≠ Rj and/or
Ci ≠ Cj for all i ≠ j.
(All of the given cells are different cells in the grid.)
Time limit: 40 seconds.
1 ≤ R ≤ 200.
1 ≤ C ≤ 200.
Time limit: 80 seconds.
1 ≤ R ≤ 109.
1 ≤ C ≤ 109.
4 2 3 2 2 2 1 4 1 2 7 1 2 1 1000000000 1 2 1000000000 3 1 2 100 1 1 1 3 1 202 2 2 2 2 2 1 1 2 2 4
Case #1: 40 Case #2: 999999986 Case #3: IMPOSSIBLE Case #4: IMPOSSIBLE
In Sample Case #1, the optimal way to finish the painting is:
6 7 9
4 6 8
and the sum is 40.
In Sample Case #2, the optimal way to finish the painting is:
2000000000 1000000000
and the sum is 3000000000; modulo 109+7, it is 999999986.
In Sample Case #3, the task is impossible. No matter what value you choose for the cell in row 2, it will be too different from at least one of the two neighboring filled-in cells.
In Sample Case #4, the two cells that Cody-Jamal filled in already have brightness values that are too far apart, so it is impossible to continue.
In the Small dataset, the googlements can have a maximum length of 5. We will discuss googlements of that length; the procedure for shorter ones is the same.
Any string of length 5 that uses digits only in the range [0, 5] and is not
00000
is a googlement; there are 65 - 1 = 7775 of
these. One workable solution is to iterate through all of these possible
googlements and simulate the decay process for each one until it reaches the
sole possible looping end state of 10000
. As we do this, for
each googlement, we maintain a count of how many decay chains it appears in.
When we are finished, we will know how many possible ancestors each
googlement has, and so we will have the answer to every possible Small test
case!
Later in this analysis, we will show why 10000
is the only
googlement of length 5 that is its own descendant, and we will look into how
long a decay chain can go on. Even if each chain were somehow as long as the
number of googlements, though, with only 7775 googlements, this method would
still easily run fast enough to solve the Small.
That approach is "top-down"; a "bottom-up" approach to this tree problem also
works. For a given googlement, we can figure out what digits must have been
in the googlement that created it (its "direct ancestor"). For example, if we
are given 12000
, we know that any direct ancestor must have one
1
, two 2
s, and two 0
s to bring the
total number of digits to five. Then we can create all such googlements
(e.g., 12020
) and recursively count their direct
ancestors (e.g., 42412
) in the same way. This process does not
go on forever, because (as we noted above) there are at most 7775 googlements
in the tree. The worst-case scenario is 10000
, which is a
descendant of every googlement.
After thinking through the bottom-up approach, only implementation details remain. The toughest part is generating direct ancestors with particular counts of particular digits. A permutation library can help, or we can write code to generate permutations in a way that avoids repeatedly considering duplicates. We can save some time across test cases by memoizing results for particular nodes of the tree, since the decay behavior of a given googlement is always exactly the same, and some of the same googlements could (and probably will) show up many times in different cases.
In the bottom-up Small approach outlined above, we spent a lot of time
generating new direct ancestors to check. For ancestors such as
12020
that themselves had direct ancestors, this was necessary.
However, it was a waste of time to construct and enumerate ancestors
such as 42412
, which have no ancestors of their own. More
generally, a googlement of length L with a digit sum of more than L cannot
have any ancestors, since it is impossible to fit more than L digits into a
length of L.
It turns out that avoiding enumerating these ancestor-less ancestors saves enough time to turn that bottom-up Small solution into a Large solution. We can do that with some help from combinatorics and the multinomial theorem.
Let's think again about the direct ancestors of 12020
. Each must
have one 1
, two 2
s, and two 4
s. How
many ways are there to construct such a string? Starting with a blank set of
5 digits, there are (5 choose 2) = 10 ways to place the 2
s, then
(3 choose 2) = 3 ways to place the 4
s into two of the three
remaining slots, then (1 choose 1) = 1 ways to place the leftover
1
. These terms multiply to give us a total of 30 ways, so
12020
has 30 direct ancestors. Since each has a digit sum of 13,
which is greater than 5, none of them have their own ancestors. So we do not
care what they are; we can just add 30 to our total.
At this point, we can either test this improvement on the worst-case
googlement 100000000
before downloading the Large dataset, or
we can reassure ourselves via more combinatorics. The googlements of length 9
with ancestors are the ones with a digit sum less than or equal to 9; we can
use a
balls-in-boxes argument
to find that that number is (10 + 9 - 1)! / (10! * (9-1)!), which is 92377.
This is a tiny fraction of the 999999999 possible googlements of length 9; we
have avoided enumerating the other 999900000 or so! As long as our method for
generating ancestors doesn't impose too much overhead, this is easily fast
enough to solve 100 Large test cases in time, even if most or all of them
explore most or all of the tree.
Let's prove that there is only one looping state for a given googlement length, and that there are no other loops in the graph. We will start with some observations.
0
digits in
the googlement equals the sum of the digits in the googlement it decays
into.0
s or 1
s will decay into a googlement with a
smaller digit sum.0
s and 1
s
will decay into a googlement with the same digit sum. If the googlement
is 1
followed by zero or more 0
s, it will decay
into itself. If the googlement has a single 1
at another position,
it will decay into a 1
followed by 0
s. Otherwise, it
must have at least two 1
s, so it will decay into a googlement with
the same digit sum but with at least one digit other than 0
or
1
.We can draw some useful conclusions from the above observations:
1
followed by L-1 0
s. For any other googlement,
decay would either reduce its digit sum or create a new googlement with
a digit other than 0
or 1
, which would itself
decay into a googlement with a smaller digit sum.999999999
will immediately decay into 000000009
, losing a large amount
of its digit sum.The ordered pairs of friends form a directed graph in which each friend is a node and each news path is an edge. Notice that we can work on one connected component at a time, because each can be solved independently of the others. In the following, we assume a connected graph.
One additional observation is that if any edge of the undirected graph is a bridge — that is, an edge not present in any cycle — then the task is impossible. This is a consequence of the news balance on nodes extending to subsets of nodes: if we group nodes on each side of the bridge, the aggregate balance of each subset must be zero, and values in edges internal to the subset don't affect that balance. Since the bridge is the only external edge, this forces its value to be zero. Formally, the news balance property is preserved by vertex identification, and we can identify all vertices on each side of the bridge to reach a contradiction.
Note that since we can pass positive or negative news on each edge, we can forget about the directions of edges in our graph, and consider it as undirected. If the directed version had opposing edges (v, w) and (w, v), the undirected version can just have two edges between v and w. After solving the problem on the undirected graph, we simply adjust the sign of the value we assign to each edge according to its original direction.
The Small dataset can have up to 4 friends, so there can be as many as 12 edges. The bad news is that with up to 2 × 42 = 32 possible legal values for each of those edges, there can be 3212 possible answers to check, which is far too large to handle with brute force. The good news is that there are only on the order of 212 graphs to consider; we can rule out the ones that have bridges (since those are impossible), and then run whatever method we come up with on all of the others to see if we get solutions.
What if we don't really need the full extent of the [-16, 16] range? We can try restricting our values to a smaller set and see whether we get solutions. As Sample Case #5 demonstrates, we might need negative values, so we can try a set like {-2, -1, 1, 2, 3}, for instance. It turns out that this set alone is enough to solve all possible Small cases that do not contain a bridge, so we can use brute force to find all possible assignments.
Judging by the Small solution, we may believe that all bridgeless graphs do have a solution, and that's indeed the case (as we will see in Solution 2, below).
Solution 1. Start by using depth-first search to build a spanning tree of the underlying undirected graph. Assign all non-tree edges a random nonzero value in [-K, K]. Now, the current balance of each node is some integer. In leaf-to-root order, we can go through all nodes using the single non-assigned edge remaining to make the balance zero. Notice that if we fix all F-1 non-root nodes, the root is automatically fixed, because the sum of the balance over all nodes always equals 0 (an edge with value v contributes v to one balance and -v to another). Notice that it is possible that a node's balance is 0 when we need to fix it, leaving us no non-zero choice for the corresponding tree edge. Additionally, if the balance on a node is too high, the value we need for the corresponding edge could be out of range. However, a value for K around F yields a small probability of both, which we can find through experimentation. And, if we happen to run into a problem, we can re-run the algorithm. As long as the probability of success is a not-too-small constant, running it enough times will eventually yield a valid assignment. Since we can implement this in linear time, and the bounds are small, we can afford thousands of runs per test case, or possibly more. So, even with weak experimental data on the probabilities, we can build enough confidence that this method works. Notice that we can avoid explicitly checking for bridges by assuming that failure after 1000 or so retries means the case is impossible.
Solution 2. Again, find a DFS spanning tree of the graph and number the nodes in discovery order. Remember that in a DFS over an undirected graph, every non-tree edge goes from a node to some ancestor in the tree. Direct all edges in root-to-leaf direction (we reverse or split edges after solving, as explained above). We assign edges not in the tree a value of 1, that is, they send positive news from nodes to descendants. On the other hand, we assign negative values to tree edges. We process the nodes in reverse discovery order: as in the previous randomized solution, each node has a single adjacent edge with no assigned value, so we assign it the only value that makes the node balanced.
Let us prove that the values assigned this way are all negative if there is no bridge or contain a zero if there is a bridge: assume a value of 0 for unassigned edges to check the current balance of a node x right before we attempt to balance it. Let A be the set of descendants of x, including x itself. There is exactly one tree edge going into A: the edge that we are trying to assign at this step. Let us call it e. All other edges going into A are non-tree, so they all have a value of 1. The current aggregate balance of A then equals the number of non-tree edges going into A. Since nodes are processed in leaf-to-root order, all nodes in A other than x are balanced, and A's balance equals x's balance. If the number is zero, then there are no edges other than e going into A, which makes e a bridge. If the number is greater than 0 (it is a number of edges, so it can't be negative), the current balance of x is greater than 0 and we can assign its opposite, a negative number, to e. This value is always within the legal [-F2, F2] range, because its magnitude is equal to some number of edges in the graph, and the total number of edges in the graph cannot exceed F × (F - 1), which is less than F2.
Solution 3. We keep the nodes balanced at all times. Start with 0 on all edges. For each undirected edge, if it has a 0 in the current valuation, find a cycle that contains it and then add -K to all edges of the cycle, where K is an integer different from the current values of all edges in the cycle. This choice of K guarantees that the procedure doesn't change a non-zero edge into zero, and it always changes at least one zero edge into a non-zero (the current edge), so it always finishes. Adding a value to edges in a cycle maintains the balance of each node. If we choose K with minimum possible absolute value, and cycles as short as possible, this won't exceed the range. The easy to prove maximum absolute value this can produce is F × P / 2, because K is always in the range [-F/2, F/2] as there can't be more than F forbidden values for it. There are also at most P steps. However, there are many results showing that graphs with many edges have lots of short cycles, so we believe it's impossible to construct a case in which the range is exceeded. Further, if we randomize the order to fix edges and retry many times, we can work around the worst cases that specifically try to overload a given edge, but possibly produce lots of small cycles that help fix the others cheaply. Notice that the idea of using the DFS tree to fix balances in the previous solutions is actually the same idea from this theoretically simpler solution, but using the fundamental cycles of the graph instead of potentially all of them.
Since each hiking tour can be taken only once, each of the hiking tours arriving at a camp must be followed by a different departing tour. Since each camp has two arriving and two departing tours, there are exactly two ways to “pair” the hiking tours at a camp: the first arrival with the first departure (and the second arrival with the second departure), or the first arrival with the second departure (and the second arrival with the first departure). We can think of a candidate solution as specifying this boolean decision for each camp. The total duration is the sum of each hiking trip's duration and the amount of time waiting at each camp.
When evaluating a possible solution, you must check whether or not every hiking tour is present on the path beginning with your start hike. It is possible for a path through the graph to represent a cycle with fewer than 2C edges. For example, the left side of the figure farther down on the page shows a setup with three disjoint paths through the graph.
Because the time spent waiting at the base camp is calculated differently than the time spent waiting at all other camps, the base camp needs to be handled as a special case. You could simply run the algorithm four times, once for each “start” and “end” hike.
The entire space can be explored in O(2C) time, which is sufficient for the Small dataset with C ≤ 15.
Another way to look at the small dataset is as a special case of the Travelling Salesman Problem (TSP). Consider the directed graph in which each hiking tour is a node and edges are the possible pairings between hiking tours, four for each camp, with edge weights corresponding to the amount of time you would have to wait at the camp in order to make that transfer. For example, if hiking tour A arrived at node 1 at time 02:00, and hiking tour B left node 1 at time 06:00, there would be an edge from A to B with weight 4. Running TSP on this graph, again with special handling for the base camp, can also yield a correct solution.
Let's look more closely at the two possible arrival-departure pairings for each camp. Suppose a camp has tours arriving at 13:00 and 21:00 and tours leaving at 22:00 and 07:00, such that there are no departures between the times that the two tours arrive. If the 13:00 arrival were paired with the 22:00 departure and the 21:00 arrival were paired with the 07:00 departure, the total wait time at the camp would be (22 – 13) + ((7 – 21) mod 24) = 19 hours. If we did the other pairing, the total wait time would be (22 – 21) + ((7 – 13) mod 24) = 19 hours. Both pairings result in the same total wait time at the camp. We will call these camps the “free” camps.
Now consider a camp having tours arriving at 11:00 and 23:00 and tours leaving at 17:00 and 08:00, such that after each tour arrives, there is a tour that leaves before the other tour arrives. If the 11:00 arrival were paired with the 17:00 departure and the 23:00 arrival were paired with the 08:00 departure, the total wait time at the camp would be (17 – 11) + ((8 – 23) mod 24) = 15 hours. If we did the other pairing, the total wait time would be ((17 – 23) mod 24) + ((8 – 11) mod 24) = 39 hours, 24 hours longer than the other pairing.
We mentioned that for the small solution, you must ensure that every hiking tour is present on the path starting from the base camp. What does the graph look like if there are hiking tours that are not present on that path? It is a set of disjoint cycles. Each camp has two paths crossing through it. If those paths are on different cycles, switching the pairing of that camp will in effect merge the two cycles together.
For example, consider the following figure with 5 camps. Camps 2 and 5 are free, and camps 3 and 4 are 24-hour switches. The initial graph has 3 disjoint cycles. As it turns out, to merge all of the cycles together, it is necessary to switch the pairing of camp 5 for free and either camp 3 or camp 4 for a 24-hour penalty. Note that the red path, the one starting and ending at the base camp, does not pass through camp 5 prior to switching the pairings, but switching the pairing of camp 5 is nonetheless required.
In this initial state, wait times are minimized, but there are several disjoint cycles.
Now there is only one cycle.
Thus, to solve the problem efficiently, add each hiking tour to a Disjoint Sets data structure. Iterate through each camp. If the camp is free, union all four hiking tours connected to that camp into the same set (this is analogous to letting the free camps take either pairing); otherwise, union each arrival with only its respective departure. You will end up with one disjoint set per cycle in the graph. If there are Q > 1 disjoint sets, since all of the free camps have already been accounted for, you will need to switch the pairing of Q – 1 camps with 24-hour penalties. These Q – 1 camps are guaranteed to exist. The base camp still needs special handling; you can afford to run the algorithm once for each of the four possible start and end hikes. The total duration is the sum of all the hiking tour durations plus the lowest waiting time at each camp plus 24 hours for each of the Q – 1 penalties. This solution takes O(C α(C)) time, where α(C), the inverse Ackermann function, is the amortized time per operation on the Disjoint Sets data structure.
To formalize the reasoning about this problem and yield both an algorithm and a proof of its correctness we will resort to some graph theory. Consider an undirected graph G where the set of R × C + 1 nodes is the set of cells in the painting plus an additional special node we call root. We add an edge between each pair of adjacent cells with length D and for each cell that has a fixed brightness value v we add an edge from the root to the cell node with length v.
Let p(c) be the length of the shortest path on G from the root to the node that represents cell c.
Property 1. The p(c) is an upper bound on the value that can be assigned to c. Let
root, a1, a2, ..., ak=c
be the path of k edges
from the root to c of total length p(c). By construction of G, a1 is a fixed cell.
Let v be a1's value. Also by construction, the edge (root, a1) has length v
and all other edges (ai, ai+1) have length D.
Thus, p(ai) = v + (i - 1) × D. Clearly p(a1) = v is an
upper bound on its value v, and by induction, if p(ai) is an upper bound on the
values that can be assigned to ai, p(ai+1) = p(ai) + D is
an upper bound on the values that can be assigned to ai+1 because it can't differ by more
than D with the value assigned to ai because the corresponding cells are adjacent
(any edge from G not involving the root connects nodes representing adjacent cells).
Property 2. Let c be a fixed cell with value v. If p(c) ≠ v, then the case is impossible. Since there is an edge (root, c) with length v, p(c) ≤ v. And, if p(c) < v, the case is impossible by Property 1.
Property 3. If p(c) is exactly the value assigned to c for all fixed cells c, then the case is possible and assigning p(c) to each non-fixed cell c is a valid assignment of maximum sum. By the precondition we know that p assigns all fixed-cells their original value, so we only need to check if neighboring cells are assigned values that differ by no more than D. Let c and d be two neighboring cells. Since G contains an edge (c, d) with length D it follows by definition of shortest path that p(c) ≤ p(d) + D and p(d) ≤ p(c) + D. Since p is a valid assignment, and by property 1, it assigns all cells a maximum value, it follows immediately that p is a valid assignment of maximum sum.
This yields an algorithm to solve the Small dataset: calculate the shortest path from the root to all other cells using Dijkstra's algorithm and then use Property 2 to check for impossible cases. If the case is possible, the answer is just the sum of p(c) over all cell nodes c. Dijkstra's algorithm takes O(R × C × log (R × C)) time while both checking for Property 2 and summing take O(R × C) time. Therefore, the overall algorithm takes O(R × C × log R × C) time, that fits comfortably within the Small limits.
Another similar approach that sidesteps the graph theoretical considerations is noticing that by transitivity, a cell at S orthogonal steps of a fixed-cell with value v cannot be assigned a value greater than v + S × D. That means that if two fixed cells are at S orthogonal steps and their value differs by more than S × D the case is impossible. Otherwise, it can be shown that it's possible and a valid assignment of maximum sum results from assigning each cell the minimum of v + S × D over all values for v and S for each fixed cell (which is, of course, the exact same assignment as p() above). Checking all pairs of fixed-cells takes O(N2) time and finding the assignment takes O(R × C × N) time, which means this yields an algorithm that takes O(R × C × N) time overall and it also passes the Small. The claims above can be proven directly, but the graph notation makes it easier, and the claims in both cases are essentially the same. Additionally, the graph theoretical approach directly yields a more efficient Small-only solution.
The size of the grid in the Large dataset is too big to inspect the cells one by one, so we need a different approach. The foundations built while solving the Small dataset, however, are still tremendously useful. We define the same G and function p as before.
Property 4. For every cell c there is a shortest path in G from the root to c of length k:
root, a1, a2, ..., ak=c
where a1 is a fixed cell and there is some i
such that each edge (aj, aj+1) come from a horiztonal adjacency if and only
if j < i. In plain English, there is path that goes from the root to a fixed cell a1,
then moves horizontally zero or more times, and then moves vertically zero or more times.
We can prove this easily by noticing that if the last k-1 steps include h horizontal and k-1-h
vertical steps of any shortest path, a path that does h horiztonal steps first
and k-1-h vertical steps last will reach the same destination (through different intermediate
cells), and since the length of the edges of all horizontal and vertical is the same (D),
the resulting path is also a shortest path.
Let us call our original R × C matrix M. We can use a technique called coordinate compression to consider only the interesting part. We call a row or column interesting if it is in the border (top and bottom rows and leftmost and rightmost columns) or if it contains at least one fixed cell. The interesting submatrix M' is the submatrix of M that results on deleting all non-interesting rows and columns. Notice that M' contains all fixed cells of M, and possibly some non-fixed cells as well. The size of M' is however bounded by (N+2)2, which is much smaller than R × C in the largest cases within Large limits.
Define the graph G and shortest path function p for M in the same way as for the Small. Define also a smaller graph G' whose nodes are a root and cells that are in M'. G' contains one edge (root, c) with length v for each fixed-cell c with value v. Notice that, since M' contains all fixed cells, the root and all its outgoing edges in G' are the same as in G. G' also contains an edge connecting two cells that are orthogonal neighbors in M'. The length of each such edge (c, d) is S × D where S is the distance in orthogonal steps between c and d in the original matrix M.
In the depicted input matrix there are 3 fixed cells. The 3 interesting rows and 4 interesting columns are highlighted in light red, and the cells in the intersection of an interesting row and an interesting column are highlighted in dark red. Those dark red cells make up M' and the nodes of G' (besides the root). The fixed cell containing a 3 has two neighbors in G'. The edge going to its vertical neighbor has length 2 × D, because it is 2 steps away in the original M. Similary, the edge going to its horizontal neighbor has length 3 × D.
Having G', we can define p'(c) as the shortest path in G' from the root to each cell c that exists in M'.
Property 5. For each cell c that exists in M', p'(c) = p(c). To prove this, consider a shortest
path
root, a1, a2, ..., ak=c
in G from the root to c with the hypothesis of Property 4 (that is, it does horizontal steps
first, and vertical steps last). First, notice that if the edge going into ai is
horizontal and the edge going out of it is vertical (that is, ai is the only corner),
then ai is in M', because it shares a row with a1, which is a fixed cell and
thus it is in M', and a column with ak=c, which is also in M'. Between a1 and
ai all moves are horizontal, so we can "skip" the ajs not in M', and the
length of the edges in G' will exactly match the sum of the lengths of all intermediate edges.
The analogous argument works for all the vertical moves between ai and ak.
Notice that Property 5 implies that we can use a similar algorithm over G' to distinguish the impossible cases, as we can calculate p' and then know the value of p'(c) = p(c) for all fixed cells c, which we can use to check Property 2. We still need, however, a way to know the sum over all p(c) for the possible cases, which we can't calculate explicitly.
Let a cell c in M be at row i and column j. Let i0 be the largest interesting row that is no greater than i, and i1 be the smallest interesting row that is no less than i. Notice that i0 = i = i1 if i is interesting and i0 < i < i1 otherwise. Similarly let j0 and j1 be the closest interesting columns to j in each direction. We call the up to 4 cells at positions (i0, j0), (i0, j1), (i1, j0) and (i1, j1), which are all in M', the sourrounding cells of c.
Property 6. For each non-fixed cell c in row i and column j of M there is a shortest path from the root to c in G that goes through one of its sourrounding cells. This can be proven similarly to Property 4. After the first step of going from the root to the appropriate fixed cell a1, any path that has the minimum number of horizontal and vertical steps yields the same total length. Notice that there is always a sourrounding cell that is closer to a1 than c (that's why they are "sourrounding"). Therefore, we can always take a path that goes through that sourrounding cell.
Given Property 6, we can build G', calculate p', and then solve each contiguous submatrix of M delimited by interesting rows as a separate problem. Each of those problems is an instance of our original problem in which we have fixed exactly the four corners. There is overlap in the border among these subproblems, but we can simply subtract the overlapping from the total. Calculating the sum of the overlapping part requires a few of the simple observations required to calculate the problem with four corners fixed, so we concentrate on solving the following: for a given matrix size and values in its 4 corners, calculate its sum (modulo 109+7). It's important not to do modulo while calculating p or p', as that can ruin the calculation because we have inequalities in calculating shortest paths, and inequalities are not preserved under modulo operations. Notice that, if U is an upper bound for the given fixed values for cells, the longest path consists of at most 2 × U steps, so the highest value in the image of p is at most U + 2 × U × D, which is at most U + 2 × U2. With U up to 109, that value fits in a 64-bit signed integer, which means we don't need large integers to hold on taking the results modulo 109+7 until the summing part.
Let us call the matrix A and the 4 corners tl, tr, bl and br for top-left, top-right, bottom-left and bottom-right. As we argue in the Small dataset section, each cell's value is determined by one of the fixed cells, in this case, one of the four corners. Given the existence of the shortest path tree, the region of cells that are determined by each given corner (breaking ties by giving an arbitrary priority to corners) is contiguous. For a given corner x and cell c, let us call the influence of x over c i(x,c) to the fixed value x has plus S × D where S is the number of orthogonal steps between x and c. We call lower values of influence stronger. The corner that determines the value of a cell is therefore any of the ones with the strongest influence.
Now, consider the top row: tl has a stronger influence than tr over a left-most contiguous set of cells, and tr has a stronger influence over a right-most set of cells. There may be a single cell where the influence strength is equal. It is not hard to prove that the column at which the strongest influence switches from being tl to tr (if looking from left to right) is the same in this top row as in any other row, because the influence values from tl and tr for the i-th row (from top to bottom) are exactly the same values as the values in the top row plus i × D, so the most influential between tl and tr is always the same across cells in any given column. A similar thing happens with each pair of non-opposite corners. There are 4 such pairs. If we consider the lines that split the influence region of each of those pairs, we have up to 2 vertical and 2 horizontal lines (some of them may overlap), dividing A into up to 9 pieces. All except the middle piece have a single corner that has the most influence, thus they can be solved in the same way.
Consider a matrix of r rows and c columns with a single influential corner with value v. The sum of a row containing the value v is v + (v + D) + (v + 2 × D) + .... This summation can be calculated with a formula. And then, the sum of each other row is c × D larger than the previous one, as we add D to the sum for each column. Again, this yields a summation over a known linear function, which can be reduced by the same known formula.
The middle piece of A has influence from two opposite corners (which pair of opposites depends on the order of the lines). We can again partition A into up to 3 parts: rows with influence from one corner, rows with influence from the other corner, and rows with influence from both. Two of those can be summed with a similar formula as the single influential corner case. The rest is a rectangle partitioned into two stepped shaped pieces where the influence is divided. Those laddered pieces can be summed as the summation over a certain range of a quadratic function, which can also be reduced to a formula.
This finishes the problem. There are lots of technical details, specifically math details, that aren't covered in detail, but we hope this conveys the main ideas. We encourage you to fill in the gaps yourself and ask the community to help out if you can't, as it will be really good practice for your next contest.